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SEQ IP NO: 


ACCESSION 
WO. 


DESCRIPTION 


RESULTS* 








17.27 4.462e-ll 47-70 
PR00449D 10.79 7.120e- 
11 109-123 


235 


PR00019 


laUCIWE-RICH REPEAT 
SIGNATURE 


PR00019B 11.36 7.3D0e- 
10 251-265 PR00019B 
11.36 5.320e-09 119- 
133 PR00019B 11.36 
l-000e-08 229-243 


236 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019B 11.36 7.300c* 
10 245-259 PR00019B 
11.36 5.320e-09 113- 
127 PR00019B 11.36 
1.000e-08 223-237 


237 


PD00289 


PROTEIN SH3 DOMAIN 
PJSP3AT PRESYNA. 


PD00289 9.97 B.448e-09 
67-81 


240 


PRO 0011 


TYPE III EGF-LIKE 
SIGNATURE 


PROOOllD 14.03 3.492e- 
10 616-635 


241 


PRO 0011 


TYPE III EGF-LIKE 
SIGNATURE 


PROOOllD 14.03 3.492e- 
10 616-635 


244 


BL00903 


Cytidine and 
deoxycy t idyla t e 
deaminases zinc -binding 
region s. 


BL00903 12.93 8 . 941e- 
12 54-64 


245 


DM00179 


w KINASE ALPHA ADHESION 
T-CBLI*. 


DM00179 13-97 8.043e- 
09 124-134 


248 


BL00246 


Wnt-l family proteins. 


BL00246D 23.97 l.OOOe- 
40 186-239 BL00246E 
20.32 1.000e-40 305- 
351 BL00246B 13.69 
4.176e-36 105-140 
BL00246A 15.75 2.286e- 
24 70-90 BL00246C 
15.56 4.857e-22 150- 
175 


250 


PRO 092 7 


ADENINE NUCLEOTIDE 
TRANSLOCATOR 1 SIGNATURE 


PR00927E 14.93 5.114e- 
10 253-275 


254 


BL00674 


AAA-protein family 
proteins . 


BL00674B 4.46 l.OOOe- 
09 223-245 


255 


PD01796 


PROTEIN TRANSMEMBRANE 
COBALT ZINC CADMIU. 


PDD1796 15.01 6.045e- 
09 61-88 


255 


BL50002 


Src homology 3 (SH3) 
domain proteins profile. 


BL50002B 15.18 2.800e- 
10 421-435 


258 


PR00094 


ADENYLATE KINASE 
SIGNATURE 


PR00094C 12.94 2.200e- 
18 87-104 PR00094D 
12.52 2.731e-14 161- 
177 PR00094A 10.31 
S.500e~14 11-25 
PRO0O94B 11.01 4.115e- 
13 39-54 PR00094E 
11.25 7.333e-13 178- 
193 


259 


BL00892 


HIT family proteins. 


BL00892A 18.17 5.500e- 
13 60-91 


262 


BL00388 


Proteasome A- type 
subunits proteins. 


BL00388A 23.14 l.OOOe- 
40 B-54 BL0038BB 
31.38 3.864e-33 66-108 
BL00388D 20.71 l.OOOe- 

jZX xbJ-lo4 dJjUUJOOC 

18.79 8.147e-16 126- 
148 


264 


BL00903 


Cytidine and 
deoxycyt idy la t e 
deaminases zinc -binding 
region 3. 


BL00903 12.93 5.B21e- 
09 91-101 


267 


BL00107 


Protein kinases ATP- 
binding region proteins - 


BL00107B 13.31 1.529e- 
09 241-257 


270 


BL00226 


Intermediate filaments 
proteins. 


BL00226D 19.10 l.OOOe- 
37 362-409 BL00226B 
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ACCESSION 
NO. 
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RESULTS* 








23.86 6.043e-35 196- 
244 BL00226C 13.23 
7.000e-20 261-292 
BL00226A 12.77 6.143e- 
15 96-111 


271 


PD02952 


KINASE TRANSFERASE 
CHOLINE PROTEIN 
MULTIGENE FAMI. 


PD029S2C 15.76 9.731e- 
16 235-265 PD02952B 
15.57 5.625e-09 215- 
229 


272 


PD02929 


ADHESION GLYCOPROTEIN 
PRECURSOR I. 


PD02929A 28.27 l.OOOe- 
40 106-160 PD02929B 
18.36 8.800e-17 179- 
199 


274 


BL01O27 


Glycosyl hydrolases 
family 39 proteins. 


BL01027B 15.34 3.486e- 
09 213-250 


275 


PRO 0424 


ADENOSINE RECEPTOR 
SIGNATURE 


PR00424D 14.32 6.451e- 
11 39-59 


277 


BL00 052 


Ribosomal protein S7 


BL00052A 27.85 6.000e- 

15.17 S.143e-12 208- 
235 


279 


BL00790 


Receptor tyrosine kinase 
class V proteins. 


BLO0790N 13.25 5.659e- 
13 267-294 


280 


PR00319 


BETA G- PROTEIN 
ITRANSDUCIN) SIGNATURE 


PR00319D 11.64 6.625e- 
23 107-125 PR00319C 
13.41 1.000e-21 89-105 
PR00319A 15.27 8.364e- 
21 51-68 PR00319B 
11.4/ B.200e-19 70-o5 


281 


PRO 03 19 


BETA G- PROTEIN 


PR00319D 11.64 6.625e- 

13.41 1.000e-21 76-92 

21 38-55 PR00319B 
11.47 8.200e-19 57-72 


2B7 


PF00929 


Exonuclease . 


PF00929D 16.17 7.3S6e- 
09 149-163 


291 


BL00326 


Tropomyosins proteins. 


BL00326A 14.01 2.360e- 
09 93-127 


292 


BL00326 


Tropomyosins proteins. 


BL00326A 14.01 2.360e- 
09 93-127 


294 


PDO0066 


PROTEIN ZINC- FINGER 
METAL - BIND I . 


PD00066 13.92 8.714e- 
12 203-216 


295 


BL00028 


Zinc finger, C2H2 type, 
domain proteins. 


BL00028 16.07 5.500e- 
15 322-339 BL00028 
16.07 9.471e-14 433- 
450 BL00028 16.07 
4.600e-13 648-665 
BL0002S 16.07 5.500e- 
13 76D-777 BL00028 
16.07 9.550e-13 78S- 
805 BL00028 16.07 
3.348e-12 704-721 
BL00028 16.07 6.478e- 
12 461-478 BL0002Q 
16.07 8.435e-12 844- 
861 BL00028 16.07 
1.692e-ll 593-610 
BL0002B 16.07 2.038e- 
11 211-228 BL00028 
16.07 S.154e-ll 732- 
749 BL00028 16.07 
5.846e-ll 377-394 
BL00028 16.07 6.885e- 
11 816-833 BL00028 
16.07 7.231e-ll 676- 
693 BL00028 16.07 
9.654e-ll 564-581 
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SBQ ID WO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 








BL00028 16.07 4.086e- 

16.07 7_429e-09 489- 
506 


296 


BL00215 


Mitochondrial energy 
transfer proteins. 


BL00215A 15.82 8.333e- 
16 111-136 BL00215A 

BL00215B 10.44 9.526e- 
11 152-165 BL00215B 
10.44 7.375e-10 59-72 
BL00215A 15.82 9.824e- 
10 205-230 


302 


PF00953 


Glycosyl transferase. 


PF00953C 19,70 8.773e- 
34 236~269 Pr009D3A 
19.68 5.000e-25 102- 
129 , PF00953B 6.17 
i.000e-13 182-194 | 


304 


PF00152 


tRNA synthetases class 

II . ; 


PF00152D 21.30 8.364e- 

28.03 9.250e-21 220- 
257 PF00152B 15.67 
2.6S8e-13 1S9-184 
PF00152A 19.68 5.714e- 
11 44-67 


305 


PD01066 


PROTEIN 2INC FIKGER 
ZINC- FINGER METAL- 
BINDING NU. 


PD01066 19.43 8.250e- 
35 37-76 


306 


PD02784 


PROTEIN NUCLEAR 
RIBONUCLBOPROTEIN . 


PD02784B 26.46 5-840e~ 
09 92-135 


307 


PRO 04 54 


ETs "DOMAitN SIGNATURE 


PR00454C 11.24 7.808e- 
09 1167-1186 


308 


PRO 02 3 7 


RHODOPSIN-LIKB GPCR 
SUPERFAMILY SIGNATURE 


PR00237E 13.03 5.091e- 
13 188-212 PR00237G 
19.63 7.207e-13 268- 
295 PR00237A 11.48 
4.375e-ll 24-49 
PR00237C 15.69 3.057e- 
10 101-124 PR00237D 
8.94 4.750e-10 137-159 
PR00237P 13.57 5.364e- 
10 230-255 PRO0237B 
13.50 9.438e-10 57-79 


309 


BL00522 


DNA polymerase family X 
proteins. 


BL00522C 11.90 7.5>77e- 
24 315-339 BL00522P 
14.90 I.310e-15 470- 
494 BL00522A 25.52 
X . Zbr»e- JL1 

BL00522E"19.63 8.615e- 
14 430-460 BL0052.2B 

313 


no 






BL00126D ft 76 5.235e- 
10 656-897 


312 




major histocompatibility 
complex proteins. 


BL0029QA 20.89 4.706e- 
14 151-174 BL00290B 
13.17 9.000e-12 211- 
229 


313 


BLOODS 


Ets- domain proteins. 


BL00345B 21.28 l.OOOe- 
40 34-85 BL00345A 
13.96 9.2l7e-16 1-20 


315 


PFO0651 


BTB (also known as BR- 
C/Ttk) domain proteins. 


PF00651 15.00 5.091e- 
15 63-76 


317 


BL01020 


SARI family proteins. 


BL01020C 15.35 3.19Be- 
17 79-130 


318 


BL00216 


Sugar transport 
proteins . 


BL00216B 27.64 4.696e- 
11 164-214 


320 


PR00109 


TYROSINE KINASE 
CATALYTIC DOMAIN 


PR00109B 12.27 4 . 8l4e- 
10 216-235 
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ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 






SIGNATURE 




321 


BL00027 


'Homeobox' domain 
proteins . 


BL00027 26.43 5.688e- 
10 329-372 


322 


PR00109 


TYROSINE KINASE 
CATALYTIC DOMAIN 
SIGNATURE 


PR00109B 12.27 8.765e- 
12 558-577 


324 


BL01241 


Link domain proteins. 


BL01241 35.81 8.313e- 
30 183-236 BL01241 
35.81 3.222C- 13 282- 
335 


326 


BL00412 


Neuromodulin (GAP-43) 
proteins. 


BL00412D 16.54 4.000e- 
12 515-566 BL00412D 
16.54 5.705e-ll 516- 
567 BL00412D 16.54 
7.848e-10 518-569 
BL00412D 16.54 1.827e- 
09 514-565 BL00412D 
16.54 1.918e-09 513- 
564 BL00412D 16.54 
2.102e-09 520-571 


328 


BL00232 


Cadherins extracellular 
repeat proteins domain 
proteins . 


BL00232B 32.79 9.557e- 
20 151-199 BL00232B 
32.79 2.246e-18 41-89 
BL00232B 32.79 5.985e- 

32.79 5.500e-16 258- 

JUb DIM If A j Aa SZ . #1# 

9.3B4e^l5 475-523 

HT nftTlOf 1 ft CC O CT7*» 
DLUL)y..izL 1_U - b~> A . bJ / C w 

12 256-274 BL00232C 

i ft f r~ /t ii ^ r o 
lU.bb 4. Jibe-11 ibo- 

386 BL00232C 10.65 
7.261e-ll 473-491 
BL00232C 10.65 7.457e- 
11 39-57 


330 


PR004S4 


KTS DOMAIN SIGNATURE 


PR0b45^4d 11.24 7.808e- 
09 1167-1186 


331 


BL00598 


Chromo domain proteins. 


BL00598 14.45 8.393e- 
18 27-49 


333 


BL01016 


Glycoprotease family 
proteins . 


BL01016C 22.84 3 . 925e- 
32 70-115 BL01016E 


* 






XI.OO 3.ZDDC J-il XtH* 








177 BL01016H 13.71 
7 l 577f*-13 291-301 
BL01016D 8.86 3.298e- 
11 127-140 BL01016G 
7.14 5.622e-10 261-271 
BL01016A 5.65 7.167e- 
10 4-19 BL01016P 
13.34 1.563e-09 200- 
212 BL01016B 8.93 
8.855e-09 38-50 


339 


BL01115 


GTP- binding nuclear 
protein ran proteins. 


BL01115A 10.22 5.500e- 
11 17-61 


340 


PD01066 


PROTEIN ZINC FINGER 
ZINC- FINGER METAL- 
BINDING NU. 


PD01066 19.43 1.231e- 
33 10-49 


341 


BL01160 


Kineein light chain 
repeat proteins. 


BL0116OB 19.54 5.042e- 
09 55-109 


342 


PD01066 


PROTEIN ZINC FINGER 
Z INC- FINGER METAL- 
BINDING NU. 


PD01066 19.43 2.400e- 
30 16-55 


343 


DM00031 


IMMUNOGLOBULIN V REGION. 


DM00031A 16.80 l.OOOe- 
40 20-68 


346 


PR00109 


TYROSINE KINASE 
CATALYTIC DOMAIN 
SIGNATURE 


PR00109B 12-27 4.764e- 
11 135-154 


347 


PR00109 


TYROSINE KINASE 


PR00109B 12.27 4.764e- 
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ACCESSION 
NO. 
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RESULTS* 






CATALYTIC DOMAIN 
SIGNATURE 


11 135-154 


351 


BL01187 


Calcium- binding EGF-like 
domain proteins pattern 
proteins . 


BL01187B 12.04 1.783e- 
13 100-116 BL011B7B 
12.04 8.435e-13 276- 
292 BL011B7B 12.04 
8.e00e-ll 13-29 
BL01187B 12.04 7.429e- 
10 54-70 BL01187B 
12.04 5.725e-09 231- 
247 BL01187A 9.98 
7,000e-09 255-267 


352 


PD00078 : 


REPEAT PROTEIN ANK 
NUCLEAR ANKYR. 


PD00078B 13.14 5.950e- 
10 366-379 PD00078B 
13.14 4.522e-09 168- 
181 


354 


BL0Q380 


Rhodanese proteins. 


BL00380F 9.76 6.694e- 
11 542-553 


355 


PF00628 


PHD- finger. 


PF00628 15.84 l.GGOe- 
11 116-131 


356 


PR005B7 


SOMATOSTATIN RECEPTOR 
TYPE 1 SIGNATURE 


PR00587A 8.06 9.700e- 
09 17-37 


359 


PD00066 


PROTEIN ZINC- FINGER 
METAL- BIND I . 


PD00066 13.92 4.462e- 
15 261-274 PD0006S 
13.92 6.500e-13 233- 
246 PD00066 13.92 i 
4.300e-09 289-302 


361 


PF00791 


Domain present in ZO-1 
and Unc5-like netrin 
receptors . 


PFQ0791B 28.49 9.604e- 
13 54-109 PF00791B 
28.49 1.095e-12 21-76 
PF00791A 27.85 1.432e~ 
09 71-126 PF00791B . 
28.49 7.440e-09 184- 
239 


362 


PF00791 


Domain present in ZO-1 
and Unc5-like netrin 
receptors . 


PF00791B 28,49 2.273e- 
11 279-334 


363 


PR00450 


RECOVERIN FAMILY 
SIGNATURE 


PR00450C 12.22 5.080e- 
10 73-95 PR00450C 
12.22 3.27Be-09 109- 
131 


364 


PF00242 


DNA polymerase (viral) 
N- terminal domain 
proteins . 


PF00242Q 13.51 2.328e- 
09 22-68 


3 65 


PF00242 


DNA polymerase (viral} 
N- terminal domain 
proteins . 


PF00242Q 13.51 2.328e- 
09 22-68 


366 


BLD1160 


Kinesin light chain 
repeat proteins. 


BL01160B 19.54 6.644e- 
09 1038-1092 


367 


PRO 0019 


LEUCINE -RICH REPEAT 
SIGNATURE 


PR00019B 11.36 1.360e- 
09 229-243 PR00019B 
11.36 6.040e-09 91-105 
PR00019A 11.19 8.667e- 
09 370-384 


368 


PRO 0011 


TYPE III EGF-LIKE 
SIGNATURE 


PR00011D 14.03 9-QO0e- 
15 30-49 PR00011A 
14.06 9.830e-15 30-49 

PRUQ011B IJ.Oo 4.500e- 
14 30-49 PR00011C ! 
24.25 5.143e-09 6-35 


369 


BL01032 


Protein phosphatase 2C 
proteins . 


BL01032H 11.25 4.150e- 
09 417-430 


372 


BL00478 J 


LIM domain proteins. 


BL00478B 14.79 7.750e- 
12 410-425 


373 


PD01066 


PROTEIN ZINC FINGER 
ZINC-FINGER METAL- 
BINDING NU. 


PD01066 19.43 9.757e- 
34 26-65 


376 


PRO 0170 


SODIUM CHANNEL SIGNATURE 


PR00170E 6.48 2.739e- 
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NO. 












10 88-118 


380 


BL00107 


Protein kinases ATP- 


BL00107A 18.39 l.OOOe- 

Z. j£/0"JUJ J3JU uuiu »n 

13.31 1.692e-12 342- 1 
358 


381 


BL00455 


Putative AMP-binding 


BL00455 13.31 5.714e- 
12 50-66 


382 


PR00624 


HI SCONE H5 SIGNATURE 


PR00624G 4-08 4.900e- 


384 


PD00078 


REPEAT PROTEIN ANK 
NUCLEAR ANKYR. 


PD00078B 13.14 5.950e- 
10 366-379 PD00078B 

181 


385 


PRO 05 11 


TEKTIN SIGNATURE 


PR00511D 7.11 5.371e- 
09 67-80 


386 


PD02870 


RECEPTOR INTRRT.EUKIN-1 
PRECURSOR. 


PD02870B 18.83 6.000e- 
10 97-130 


383 


PD00066 


PROTEIN ZINC- FINGER 
METAL- BINDI . 


PD00066 13.92 5.000e- 
13 516-529 


389 


BL00290 


Immunoglobulins and 
major histocompatibility 
complex proteins. 


BL00290A 20.89 7.657e- 
09 151-174 


390 


BL00215 


Mitochondrial energy 
transfer proteins. 


BL00215A 15.82 5.2D0e- 
15 221-246 BL00215A 
15.82 7.6l8fc-14 20-45 
BL00215A IS. 82 8.851e- 
11 123-148 BL00215B 
10.44 9.526e-ll 69-82 
BL00215B 10.44 7.300e- 
09 272-285 BL00215B 
10.44 8.500e-09 165- 
178 


394 


BLO 06 74 


AAA-proteirt family 
proteins. 


BL00674B 4.46 2 . 723e- 
16 299-321 


397 


PRO 004 8 


C2H2-TYPB ZINC FINGER 
SIGNATURE 


PR00048A 10.52 8.579e- 
11 141-155 


398 


PR00761 


BINDIN PRECURSOR 
SIGNATURE 


PR00761B 9.93 6.764e- 
09 55-74 


399 


BL00240 


Receptor tyrosine kinase 
class III proteins. 


BL00240B 24.70 7.907e- 
10 118-142 


401 


PF00676 


Dehydrogenase El 
component . 


PF00676B 24.71 8.Q71e- 
18 331-369 PF00676D 
14.40 3.854e-lS 406- 
506 PF00676C 16.88 
9.182e-14 454-478 


402 


BL00514 


Fibrinogen beta and 
gamma chains C- terminal 
domain proteins. 


BL0O514C 17.41 4.673e- 
28 4432-4469 BL00514G 
15.98 6.092e-14 4555- 
4585 BL0O514D 15.35 
2.532e-12 4473-4486 
BL00514F 11.65 4.280e- 

-in A cri Q-A c: rt A RT.nnRlAW 
1U ^ rilj UUOJ.1.1 

14.95 4.955e-10 4584- 
4609 


403 


PF00992 


Troponin. 


PF00992A 16-67 5-974e- 
09 105-140 


404 


PR00019 


LEUCINE- RICH REPEAT 


PR00019B 11.36 1.450e- 
10 73-87 PR00019A 
11.19 8.043e-10 76-90 
PR00019B 11.36 l.OOOe- 
09 50-64 PR00019B 
11.36 1.000e-09 96-110 


405 


BL00232 


Cadherins extracellular 
repeat proteins domain 
proteins. 


BL00232B 32.79 9.557e- 
20 139-187 BL00232B 
32.79 2.246e-18 29-77 
BL00232B 32.79 5.985e- 
18 358-406 BLD0232B 
32.79 5.500ft-16 246- 
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294 BL00232B 32.79 
9.384e-15 463-511 
BL00232C 10.65 2.537e- 
12 244-262 BL00232C 
10.65 4.326e-ll 356- 
374 BL00232C 10.65 
7.26le-ll 461-479 
BL00232C 10.65 7.457e- 
11 27-45 


407 


PP00426 


Outer Caps id protein VP4 
(Hemagglutinin) . 


PF00426S 15.67 5.634e- 
09 902-940 


409 


BL01160 


Kinesin light chain 
repeat proteins. 


BL01160B 19.54 9.69Se- 
09 126-180 


410 


BL00741 


Guanine- nucleotide 
dissociation stimulators 
CDC24 family sign. 


BL00741B 14.27 2.731e- 
09 252-275 


411 


PF00646 


F-box domain proteins . 


PF00646A 14.37 6.344e- 
09 86-100 


412 


BLOO603 


Thymidine kinase 
cellular- type proteins. 


BL00603B 11.39 8.500e- 
09 542-557 


415 


BL00866 


Carbamoyl -phosphate 
synthase subdomain 
proteins. 


BL00866B 36.29 3.571e- 
31 245-291 BL00866C 
23.26 9.000e-25 331- 
366 


418 


PR00239 


MOIJ.USCAN RHODOPSIN C- 
TERMINAL TAIL SIGNATURE 


PR00239E 1.58 6.114b- 
09 590-602 


421 


PP00791 


Domain present in ZO-1 
and Unc5-like netrin 
receptors - 


PF00791B 28.49 7.955e- 
14 23-78 PP00791B 
28.49 3.653e-12 273- 
328 PF00791B 28.49 
4.273e-ll 156-211 
PF00791B 28.49 7.81Be- 
11 89-144 PP00791B 
"3R AQ 1 R94f»-10 56-111 

PF00791C 20.98 3.559e- 
09 37-76 PF00791C 
20.98 5.235e-09 170- 
209 PP00791C 20.98 
5.235e-09 381-420 
PF00791B 28.49 6.202e- 
09 189-244 PF00791B 
28.49 7.028e-09 435- 
490 PF00791B 28.49 
8.679e-09 367-422 


424 


DM00892 


3 RETROVIRAL PROTEINASE. 


DM00892C 23.55 7.207e- 
28 1645-1679 


425 


PR00109 


TYROSINE KINASE 
CATALYTIC DOMAIN 
SIGNATURE 


PR00109D 17.04 5.881e- 
10 228-251 


429 


BL00518 


Zinc finger, C3HC4 type 
(RING finger), proteins. 


BL00518 12.23 4.600e- 
11 31-40 


431 


BL00039 


DEAD -box subfamily ATP- 
dependent helicases 
proteins. 


BL00039D 21.67 l.B44e- 
34 490-536 BL00039A 
18.44 5.6l5e~l9 205- 
244 BL00039B 19.19 
8.920e-l6 251-277 
BL00039C 15.63 5.781e- 
15 333-357 


432 


PR00452 


SH3 DOMAIN SIGNATURE 


PR00452B 11.65 7.652e- 
12 169-185 


433 


PRO 082 8 


FORMIN SIGNATURE 


PR00828B 5.23 8.218e- 
10 382-405 


436 


BL00415 


Synapsins proteins. 


BL00415N 4.29 B.643e- 
11 195-239 BL00415N 
4.29 3_036e-09 809-853 


443 


PR00834 


HTRA/DEGQ PROTEASE 
FAMILY SIGNATURE 


PR00834F 10.91 6.040e- 
11 221-234 


446 


PPD1140 


Matrix protein (MA) , 


PF01140D 15.54 9.663e- 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 






P15. 


10 183-218 PF01140D 
15.54 3.093e-09 246- 
281 


449 


PR00568 


DOPAMINE D3 RECEPTOR 
SIGNATURE 


PRG056BG 13.95 5,551e- 
09 39-53 


451 




{SCR repeat proteins. 


10 47-59 


452 


BL0073G 


Receptor tyrosine kinase 
class V proteins. 


BL00790I 20.01 2.82le- 
09 618-649 


456 


PR00380 


KINESIN HEAVY CHAIN 
SIGNATURE 


PR00380A 14.18 l.QOOe- 
25 77-99 PR00380D 
9.93 1.000e-21 281-303 

rinnmann 1 i 10 0 iQCn 

PROOJaQC U - IB o.^ooe- 
17 230-249 PR00380B 

1 0 £a a tj/Ia if 1 QA 

212 


457 


PRO 0253 


GAMMA-AMINOBOTYRIC ACID 
(GAB A) RECEPTOR 
SIGNATURE 


PR00253A 9.15 9.l43e- 
24 246-267 PR00253B 
13.4/ 2-l>00e-^3 2 /«£- 
294 PR00253C 13.85 

PR00253D 16.68 5.950e- 

Ol JCO ATk 


467 


PR00849 


GLYCOSYL HYDROLASE 


PR00849D 9.77 9.236e- 


471 


BL00678 


Trp-Asp (WD) repeat 
proteins proteins. 


BL00678 9.67 8.2D0e-l2 
33-44 


472 


BIi0022b 


Intermediate filawents 
proteins . 


09 282-330 


473 


BL00344 


GATA- type zinc finger- 
domain proteins. 


BL00344 17.39 7.000e- 
12 814-852 


474 


BL00481 


Thiol - act lva ted 
cytolysins proteins. 


BL00481E 13.07 8.909e- 
09 173-199 


479 


PR00319 


BETA G-PROTEIN 
{ TRANSDUC IN ) SIGNATURE 


PR00319B 11.47 2.571e- 
09 393-408 


480 


PD01066 


PROTEIN ZINC FINGER 
ZINC-FINGER METAL- 
BINDING NU. 


PD01066 19.43 1.90Oe- 
38 8-47 


481 


PR00405 


HIV REV INTERACTING 
PROTEIN SIGNATURE 


PR00405C 19.41 l.OOOe- 
19 451-473 PR00405B 
11.83 4.333e-18 430- 
448 PR0040SA 17.71 
4.971e-18 411-431 


482 


PRQ0049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 9.286e- 
10 959-974 PR00049D 

PR00049D 0.00 1.305e- 
09 937- 9"^? PR00049D 
0.00 8.322e-09 939-954 


486 


pnonnn"7 
rKvuiiu f 


fYlMPTiVMFNT HO DOMAIN 

SIGNATURE 


23 653-673 PR00007A 
19.33 6.192e-22 626- 
653 PR00007C 15.60 
5.846e-19 698-720 
PR00007D 9.64 3.647e- 
13 732-743 


487 


PDO0567 


PROTEIN RNA- BINDING RNA 
REPEAT HYD. 


PD00567B 18.23 2.853e- 
09 200-214 


488 


PR00988 


URIDINE KINASE SIGNATURE 


PR00988A 6.39 4.569e~ 
12 3-21 


489 


PD01066 


PROTEIN ZINC FINGER 
ZINC-FINGER METAL- 
BINDING NU. 


PD01066 19.43 4.882e- 
27 30-69 PD01066 
19.43 3.430e-10 71-110 


490 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 7.864e- 
09 663-678 


492 


BL01128 


Shikimate kinase 
proteins . 


BL01128A 18.84 6.464e- 
17 58-92 


497 


PF00429 


ENV polyprotein (coat 


PF00429 31.08 7.l71e- 
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SEQ ID NO: 


NO. 


DESCRIPTION | 


RESULTS* 






polyprotein) . 


15 21-71 


498 


BLO012O 


Lipases, serine 
proteins . 


BL00120B 11.37 7.923e- 
09 18S-20O 


500 


BL00030 


Eukaryotic RNA-binding 
region RNP-1 proteins • 


BL00030A 14.39 7.353e- 
11 299-318 


501 


BL01159 


WW/rsp5/WWP doioain 


BL01159 13.85 8.579e- 
12 131-146 


505 


BL00021 


Kr ingle domain proteins. 


BL00021B 13.33 3.739e- 
17 492-510 


508 


PRO 0120 


H f TRANS PORTING ATPASE 
f PROTON PEJMP} SIGNATURE 


PR00120C 9.90 5.800e- 
19 705-722 


509 


DM014 17^ 


6 kw INDUCING XPMC2 
MUSHROOM SPAC22G7.04. 


DM01417E 20.62 2,938e- 
16 362-395 DM01417D 
11.08 3.800e-13 322- 
338 


510 


PF00534 


Glycosyl transferases 
group 1. 


PF00534B 14.47 6.625e- 
09 346-370 


511 


PF00534 


Glycosyl transferases 
group 1. 


PF00534B 14.47 6.625e- 
09 293-317 


512 


PF00534 


Glycosyl transferases 
group 1. 


PF00534B 14.47 6.625e- 
09 366-390 


513 


PD01841 


PHOSPH0RYLASE KINASE 
ALPHA MUSCL. 


PD01841A 21.71 l.OOOe- 1 
40 110-160 PD01B41B 
14.35 l.OOOe-40 181- 
222 PD01841D 17.87 
l.OOOe-40 243-295 
PD01841F 13.36 l.OGOe- 
40 333-382 PD01841G 
24.26 l.OOOe-40 386- 
440 PD01841L 18.42 
l.OOOe-40 968-1010 
PD01B41I 23.00 4.545e- 
37 762-804 PD01841E 
18.60 3.750e-36 295- 
333 PD01841J 14.94 
6.023e-35 851-888 
PD01841H 21.30 2.909e- 
33 490-527 PD01841K 
14. Bl 7.088e-33 924- 
954 PD01841C 13 . 78 
9.3B6e-23 222-243 

21 1054-1073 PD01841I 

31 OO O HC7i*— 11 54 
591 


514 


PRO 0153 


CYCL0P11ILIN PBPT1DYL- 
pRniYYTn r? 15? -TRANS 
IS0MERASB SIGNATURE 


PR00153C 11.01 7.188e- 
13 95-111 PR00153E 
9:10 4.150e-12 122-138 


515 


BL00740 


MAM domain proteins. 


BL00740A 13.87 7.188e- 
12 410-423 


516 


Ul w vj o J 




DMO0B92C 23.55 6.087e- 
12 1018-1052 


517 


BL0O242 


Integrins alpha chain 
proteins . 


BL00242C 16.36 B.32De- 
09 12-42 


523 


DMO QO 31 


IMMCNQGLOBTTTmIN V REGION • 


DM00031A 16.80 3.750e- 
39 20-68 DM00031B 
15.41 1.000e-25 84-118 


525 


BL00319 


Amyloidogen i c 
glycoprotein 
extracellular domain 
proteins . 


BL00319C 17.12 8.375e- 
10 61-95 


526 


PF007B9 


Domain present in 
ubiqui t in- regulatory 
proteins . 


PP00789B 19.70 3.30Be- 
12 322-343 PF00789C 
20.98 5.269e-09 367- 
392 


528 


BL01162 


Quinone oxidoreductase / 
zeta-crystallin 
proteins . 


BL01162C 22.80 l.SOOe- 
16 120-164 
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SEQ _D NO: 


ACCESSION i 
NO. 


DESCRIPTION 


RESULTS* 


529 


PR00910 


LUTEOVIRUS ORF6 PROTEIN 
SIGNATURE 


PR00910A 2.51 3.893e- 
09 60-73 


532 


BL00215 


Mitochondrial energy 
transfer proteins. 


BL00215A 15.82 4.000e- 
17 11-36 BL00215A 
15.82 8.660e-ll 123- 
148 


533 


BL00215 ; 


Mitochondrial energy- 
transfer proteins. 


BL00215A 15.82 4.000e- 
17 11-36 BL00215A 
15.82 8.660e-ll 97-122 


534 


BL0009G 


Thiolases acyl-enacyme 
intermediate proteins. 


BL00098C 21.65 2.800e- 
38 181-227 BL00098B 
32.59 5.345e-38 86-141 
BL00098D 26.30 8.364e- 
35 245-288 BL00098E 
22.12 1.000e-34 314- 
352 BL00098F 10.18 
4.97le-22 365-386 
BL00098A 10.60 6.455e- 
11 38-50 


53S 


PRO 03 70 


FLAVIN- CONTAINING 
MQNOOXYGENASE (FMO) 
SIGNATURE 


PR00370E 11.96 7.429e- 
22 321-340 PR00370D 
16.33 6_143e-21 185- 
204 PR00370P 17.75 
6.559e-21 376-396 
PR00370B 10.91 9.591e- 
21 27-46 PR00370C 
12.72 3.500e-20 140- 
157 PR00370A 3.35 
6.442e-17 4-20 


536 


BL00O28 


Zinc finger, C2H2 type, 
domain proteins. 


BL00028 16.07 7.429e- 
16 285-302 BL00029 
16.07 6.294e-14 341- 
358 BL00028 16.07 
1.346e-ll 369-386 
BLOC028 16.07 1.692e- 
11 397-414 BL00028 
16.07 4.4S2e-ll 453- 
470 BL00028 16.07 
7.23le-ll 425-442 
BL00028 16.07 4.300e- 
10 313-330 


537 


BL00762 


WHkp-trs domain 
proteins . 


BL00762A 23.43 9.419e- 
15 844-881 


538 


BL00762 


WHEP-TRS domain 
proteins . 


BL00762A 23.43 9.419e- 
15 819-856 


539 


BL00762 


WHEP-TRS domain 
proteins . 


BL00762A 23.43 9.419e- 
15 822-859 


540 


PR009G5 


LEUCYL-TRNA SYNTHETASE 
SIGNATURE 


PR0O985A 12.10 9.000e- 
10 357-375 


541 


PD021O2 


SUBUNIT E V-ATPASE 
VACUOLAR ATP SYNTHASE 
HYDROL. 


PD02102A 16.74 l.OOOe- 
40 3-47 PD02102B 
18.28 4.375e-34 57-100 
PD02102D 21.69 1.923e- 
30 179-218 PD02102C 
26.34 B.929e-26 100- 
146 


543 


BL00028 


Zinc finger, C2H2 type, 
doiqain proteins. 


BL00028 16.07 l.OOOe- 

16.07 6.400e-10 193- 
210 BL00028 16.07 
l.OOOe-09 343-360 
BLQ002B 16.07 6.914e- 
09 78-95 


545 


BL00250 


TGP-beta family 
proteins . 


BL002SOA 21.24 B.OOOe- 
31 293-329 BL00250B 
27.37 5.286e-24 354- 
390 


547 


PR00319 


BETA G- PROTEIN 


PR00319B 11.47 2.714e- 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 






(TRANSDUCIN) SIGNATURE 


09 186-201 PR00319A 
15.27 7.344e-Q9 210- 
227 


548 


BI»01204 


NF - kappa - B / Rel J dorsal 
domain proteins . 


BLO1204A 17.74 l.OOOe- 
40 8-56 BL01204D 
16.42 1.000e-40 177- 
221 BL01204E 13.83 

BL01204C 13.93 B.714e- 
22 141-160 BL01204B 
15.41 4.333e-16 102- 
ltd 


549 


PR0O326 


GTP1/OBG GTP- BINDING 


PR00326A 8.75 B.364e- 


551 


PF00632 


KECT- domain (ubiquitin- 
transf erase) . 


PF00632C 20,66 3.302e- 
23 1569-1601 PF00532B 
18.45 3.700e-21 1515- 
1543 


554 


BL00290 


Immunoglobulins and 
major histocompatibility 
complex proteins. 


BL00290B 13.17 1.600e- 
14 187-205 BL00290A 
20.89 2.059e-l4 130- 
153 


557 


DM0O215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 6.339e- 
09 846-879 


559 


DM0 1111 


4 kw PHOSPHATASE 
TRANSFORMING 61K PDF1 . 


DM01111L 11.93 3.762e- 
09 7-35 


562 


PF006S8 


Poly-adenylate binding 
protein, unique domain 
proteins . 


PF00658C 16.33 9.455e- 
32 118-155 




BL00141 


Eukaryotic and viral 
aspartyl proteases 
proteins . 


BL00141A 12.10 4.150e- 
10 472-488 


566 


PF0OB55 


PWWP domain proteins. 


PF00855 13.75 5.667e- 
15 272-289 


567 


PD01066 


PROTEIN ZINC FINGER 
ZINC- FINGER METAL- 
BINDING NO". 


PD01066 19.43 4.977e- 
13 22S-26B 


569 


BL00107 


Protein kinases AT?- 
binding region proteins. 


BL00107A 18.39 7.000e- 
19 118-149 BL00107B 
13.31 5.500e-15 183- 
199 


570 


BL0O107 


Protein kinases ATP- 
binding region proteins. 


BL00107A 18.39 7.000e- 
19 118-149 BL00107B 
13.31 5.500e-15 183- 
199 


572 


PRO 0193 


MYOSIN HEAVY CHAIN 
SIGNATURE 


PR00193D 14.36 1.857e- 
34 454-483 PR00193C 
12.60 2.636e-31 223- 
251 PR00193B 11.69 
7.750e-29 171-197 
PR00193A 15.41 2.588e- 
22 115-135 PR00193E 

la a c i. c tin to crtQ 

19.4/ o,5b9e-UJ bUo- 
537 


573 


tin nm Q-i 


Mvr»o t m ritrm/v r«tl* tm 
PIxUoJ.N Hi£&Vi UnAiN 

SIGNATURE 


fK.0UJ.yjL) 14. 3b l.ob/e- 

34 470-499 PR00193C 

1/ . OW £.OJDC- Ji 

267 PR00193B 11.69 
7.750e-29 171-197 
PR00193A 15.41 2.5B8e- 
22 115-135 PR00193E 
19.47 6.559e-19 524- 
5S3 


575 


BL00752 


XPA protein. 


BL00752B 19.17 9.703e- 
10 885-929 


576 


BL00030 


Eukaryotic RNA-binding 
region RNP-1 proteins. 


BL00030A 14.39 7-000e- 
09 276-295 


577 


BL00116 


DNA polymerase family B 


BL00116A 12.81 5.737e- 
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SEQ ID WO: 


ACCESSION 

no. 


DESCRIPTION 


RESULTS* 






proteins . 


13 864-877 BL00116B 
11.82 l.b29e-l^ yoZ- 
96S 


578 


BL00195 1 


Glutaredoxin proteins. 


BLO0195B 15_31 7.X58e- 
09 121-141 


573 


PRO 0019 


LEUCINE -RICH REPEAT 
SIGNATURE 


PR00019B 11.36 9.000e- 
11 217-231 PR00019B 
11.36 1.360e-09 386- 
400 PRO0O19A 11.19 
3.333e-09 389-403 
PR00019B 11.36 8.920e- 
09 363-377 


580 


PRO 0253 


GAMMA- AMINOBUTYRIC ACID 
(GABA) RECEPTOR 
SIGNATURE 


PR00253A 9-X5 2.125e- 
25 275-296 PR00253B 
13.47 7.923e-24 301- 
323 PR00253D 16.68 
5.846e-23 444-465 
PR00253C 13.85 2.241e~ 
20 335-357 


583 


PRO 0343 


SELECT IN SUPERFAMlLY 
COMPLEMENT- BINDING 
REPEAT SIGNATURE 


PR00343C 16.85 2.286e- 
11 1233-1252 PR00343C 
16.85 5.500e-ll 333- 
352 PR00343C 16.85 
S.500e-ll 7B3-802 
PR00343C 16.85 4.246e- 
10 1491-1510 PR00343C 
16.85 8.230e-10 1686- 
1705 


584 


DM01537 


kw SKI2W SKI 2 NUCLEOLAR 
HBLICASE . 


DM01537B 21.63 1.878e- 
37 79-126 DM01537B 
21.63 9.49le-30 916- 
963 DM01537A 15.14 
3.136e-ll 784-804 


586 


PFC0013 


KH domain proteins 
family of RNA binding 
proteins . 


PP00013 5.78 1.450e-09 
124-136 


587 


DM00892 


3 RETROVIRAL PROTEINASE. 


DM00B92C 23.55 4.409e- 
13 262-296 


5B9 


BLOb^B 


LIM domain proteins. 


BL00478B 14.79 1.643e- 
13 261-276 BL00478B 
14.79 7.709e-09 321- 
336 


590 


PP00855 


PWWP domain proteins. 


PP00855 13.75 8.000e- 
15 931-948 


591 


PF00855 


PWWP domain proteins. 


PF0D855 13.75 8-O0Oe- 
15 1062-1079 


593 


PF00628 


PHD- finger. 


PF00628 15.84 3.455e- 
12 424-439 


594 


PR00205 


CADHERIN SIGNATURE 


PR0020SB 11.39 2.241e- 
16 558-576 PR00205A 
14.73 9.308e-13 542- 
558 PR00205C 13.65 
5.304e-12 594-609 
PR00205B 11.39 4.273e- 
10 336-354 


596 


BL00107 


Protein kinases ATP- 
binding region proteins. 


BL00107A 18.39 4.789e- 
18 307-338 


598 




VJLiUJJr K\JXJ2t±VI FirtUUrl 

ENVELOPE PROBABLE U3 . 


10 55-89 


600 


BL00242 


Integrins alpha chain 
proteins . 


BL0D242E 9.03 9.591e- 
27 985-1014 BL00242C 
16.86 4.115e-26 286- 
316 BL00242D 13.57 
4.150e-25 357-382 
BL00242B 8.13 7.353e- 
12 189-199 BL00242D 
13.57 3.45Se-ll 421- 
446 BL00242A 13.80 
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SKQ ID NO: 


ACCESSION 
NO ■ 


DESCRIPTION 


RESULTS* 








5.000e-ll 61-73 
BLO0249D 1 ^ 57 4 9ft6F>— 
10 291-316 


DUX 




f*- PRfYTVTW BETA Wn-40# 
REPEAT SIGNATURE 


PR.Q0320A 16 T4 S 6!Of»- 
09 198-213 




Don n , ~i r i a 


SIGNATURE 


PR0O97BA 1? 43 A *5GQo- 
10 331-348 


603 




PhorJDol esters / 
diacylglycerol binding 


12 170-183 


604 


BL00315 


Dchydrins proteins. 


BL00315A 9.35 1.672e- 
09 424-452 


605 


BL00415 


Synapsins proteins. 


BL00415N 4.29 9.794e- 
10 295-339 


606 


PRO 0926 


MITOCHONDRIAL CARRIER 
PROTEIN SIGNATURE 


PR00926F 17.75 l.OOOe- 
13 335-358 


608 


PF00855 


PWWP domain proteins. 


PF00855 13.75 5.167e- 
15 265-282 


609 


PP00855 


PWWP domain proteins. 


PF00855 13.75 5.l67e- 
15 211-228 


612 


DM01206 


CORONAVIRUS NUCLEOCAPSID . 
PROTEIN. 


DM01206B 10.69 7.411e- 
10 877-897 DM01206B 
10.69 8.027e-10 861- 
881 DM01206B 10.69 
9.137e-10 873-893 
DM01206B 10:69 1.456e- 
09 859-879 BM01206B 
10.69 1.797e-09 879- 
899 DM01206B 10.69 
4.076e-09 865-885 
DM01206B 10.69 7.038e- 
09 898-918 DM01206B 
10.69 7.949e-09 871- 
891 DM01206B 10.69 
8.29le-09 767-787 


615 


PD02699 


PROTEIN DNA-BINDING 
BINDING DNA. 


PD02699A B.91 2.023e- 
28 129-158 PD02699C 
24.84 1.000e-27 317- 
364 PD02699B 18.28 
I.000e-17 158-1B2 


616 


PRO 03 B0 


KINKS IN HEAVY CHAIN 
SIGNATURE 


PR00380A 14.18 4.086e- 
22 288-310 PR0038OD 
9.93 3.721e-17 486-508 
PR00380B 12.64 2.241e- 
16 410-428 PR00380C 
13.18 2.976e-13 436- 
455' 


617 


PR00380 


KINESIN HEAVY CHAIN 
SIGNATURE 


PR00380A 14.18 4.0866- 
22 288-310 PR00380D 

PR00380B 12.64 2.241e- 
16 410-428 PR00380C 
13.18 2.976e-13 436- 
455 


618 


DM01206 


CORONAVIRUS NUCLEOCAPSID 


DM012C6B 10.69 5.143e- 
12 531-551 DM01206B 
10.69 2.603e-10 535- 
555 


621 


PRO 0 700 


PROTEIN TYROSINE 
PHOSPHATASE SIGNATURE 


PR00700B 16.80 3 . I60e- 
21 561-582 


622 


BL00239 


Receptor tyrosine kinase 
class II proteins. 


BL00239F 28.15 3.222e- 
10 647-692 BL00239C 
18.75 8.304e-10 543- 
566 


623 


PR00407 


EUKARYOTIC MOLYBDOPTBRIN 
DOMAIN SIGNATURE 


PR00407K 9.94 8.448e- 
09 326-339 


624 


BL00641 


Respiratory-chain NADH 
dehydrogenase 75 Kd 


BL00641C 21.10 l.OOOe- 
40 157-202 BL00641E 
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SBQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 






subunit proteins . 


24.37 1.000e-40 255- 
308 BL00641F 33,12 
1.000e-40 571-623 
BL00641A 17.15 1.81Be- 
37 48-80 BL00641B 
12.62 5.846e-34 113- 
139 BL00641D 13.23 
9.308e-29 216-240 


627 


PR001Q3 


CAMP - DEPENDENT PROTEIN 
KINASE SIGNATURE 


PR001O3B 17.80 2.500e- 
18 367-380 PR00103B 
13.39 2.080e-14 297- 
312 PR00103A 9.59 
2.957e-14 282-297 
PR00103D 10.83 3.077e- 
12 346-358 PR00103C 
15.68 1.000e-ll 334- 
344 PR00103B 13.39 
1.450e-ll 175-190 
PR00103A 9.59 1.720e- 
10 160-175 


630 


PROOOfll 


GLUCOSE/RIBITOI, 
DEHYDROGENASE FAMILY 
SIGNATURE 


PR00081A 10.53 6.2lle- 
16 4-22 


631 


PP00651 


BTB (also known as BR- 
C/Ttk) domain proteins. 


PF00651 15.00 8.500e- 
14 37-50 


632 


DM01206 


CORONAVIRUS NUCLEOCAPSID 
PROTEIN. 


DM01206B 10.69 2.233e- 
10 1324-1344 DM01206B 
10.69 4.822e-10 1276- 
1296 DM01206B 10.69 
7.658e-10 1328-1348 
DM01206B 10.69 B.274e- 
10 1280-1300 DM01206B 
10.69 4.532e-09 1320- 
1340 DM01206B 10.69 
7.266e-09 1326-1346 


635 


3L00107 


Protein kinases ATP- 
binding region proteins. 


BL00107A 18.39 7.600e- 
23 145-176 BL00107B 
13.31 2.636e-13 211- 
227 


636 


BL00657 


Pork head domain 
proteins. 


BL00657A 19.39 1.545e- 
30 101-143 BI»00657B 
22.27 7.750e-26 149- 
192 


637 


BLO01O7 


Protein kinases ATP- 
binding region proteins. 


BL00107B 13.31 l.OOOe- 
10 607-623 


643 


BTjO0018 


Kb" -hand calcium- binding 
domain proteins. 


BIjQOOIB 7.41 4.913e-09 
199-212 


647 


PF00628 


PHD- finger . 


PF0062o lb.B4 2.350e- 
13 385-400 PP00628 
15. o4 3.45be-X2 464- 
479 






Hypothetical 
yabO/yceC/sfhB family 
proteins. 


liLiU J. ± Z. -JEt J, J , Z!) 1-UUUG — 

25 332-357 BL01129C 
25.56 8.200e-23 236- 
279 BI*01129B 12.51 
6.118e-13 191-212 


649 


BL01228 


Hypothetical cof family 
proteins . 


BL01228D 17.44 3.908e- 
10 455-480 


65D 


BL00027 


'Homeobox' domain 
proteins. 


BL00027 26.43 6.684e- 
13 771-814 


651 


BL50002 


Src homology 3 {SH3) 
domain proteins profile. 


Bk50002A 14.19 1.750e- 
12 1026-1045 


653 


PR00253 


GAMMA-AMINOBUTYRIC ACID 
(GABA) RECEPTOR 
SIGNATURE 


PR00253A 9.15 4.000e- 
24 253-274 PR00253C 
13.85 8.800e-24 313- 
335 PR00253B 13-47 
3.143e-22 279-301 
PR00253D 16.68 7.652e- 
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c% r%4*x Tn vr/*\ 

SEQ ID NO: 


ACCESSION 


DESCRIPTION 


RESULTS* 




NO. 














654 


PD01719 


PRECURSOR GLYCOPROTEIN 


PD01719A 12.89 4.452e- 






SIGNAL KB. 


11 969-997 PDD1719A 
















156 PD01/19A 12.89 








7.395e-10 1276-1304 








YlT^ft-1 Tt CI A 1 *\ Oft 1 **\ *^ -*\ 

PD01/13A 12.89 1.222e- 








09 1220-1248 


657 


BL00354 


HMG-I and HMG-Y DNA- 


BL00354C 6.61 8.397e- 






binding domain proteins 


09 S63-578 






(Ahoolc) . 




658 


BL00354 


HMG-I and HMG-Y DNA- 


BL00354C 6.61 8.397e~ 






binding domain proteins 


09 580-595 






(Ahoolc) J 




659 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 2 . 174e- 








13 539-572 DM00215 








19.43 4.750e-12 549- 








582 DM00215 19.43 








9.824e-H 551-584 








DM0021S 19.43 2.929e- 








10 54B-581 DM00215 








19.43 4.054e-lC 550- 








583 DM00215 19.43 








5.339e-10 552-585 








DM0021S 19.43 7.l07e- 








-•ft ET A a cn 

10 544-5// 


660 


PRO 06 8 8 


XYLOSE ISOM ERASE 


PR00688I 13.78 9.518e- 






oXoNAIUKd 


U5r Z^»k — iJ-ib 


661 


BL00027 


• Homeobox • domain 


BL00027 26.43 5.950e- 






prote ins . 


23 249-232 


662 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 7.158e- 








10 596-610 


663 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 7.158e~ 








10 596-610 


664 


PRO 03 60 


C2 DOMAIN SIGNATURE 


PRO 03 GOB 13.61 7.158©- 








10 596-610 


666 


PR00819 


CBXX/CFQX SUPERFAMILY 


PRO0819B 10.83 8.988e- 






SIGNATURE 


10 /04- /20 


667 


BL50040 


Elongation factor 1 


BL50040C 22.62 2.143e- 






gamma chain profile. 


16 135-178 


668 


PRO 00 19 


LEUCINE- RICH REPEAT 


PR00019B 11.36 1.360e- 






SIGNATURE 


09 139-153 PRO0U19A 








11.19 1.667e-09 94-108 








PR00019B 11.36 4.600e- 








nQ H7.m 
uy ioj x / / 


670 


BL00018 


EF-hand calcium-binding 


BL00018 7.41 3.250e-10 






domain proteins. 


681-694 BL00018 7.41 








6.400e-10 /XV-730 


672 


PD00131 


ATP -BINDING TRANSPORT 


PD00131B 34.97 l.OOOe- 






TRANSMEMBR . 


34 356-410 PD00131C 








19.59 1.346e-26 504- 








542 


673 


PRO 06 67 


RETINAL PIGMENT 


PRC 06 6 7G 15.33 7.557e- 






EPITHELIUM- RETINAL GPCR 


10 106-123 












674 


■■ ... 

PRO 03 20 


G-PROTEIN BETA WD-40 


PR00320A 16.74 4-857e- 






REPEAT SIGNATURE 


13 593-608 PR00320B 








12.19 4.115e-12 635- 








650 PR00320C 13.01 








B.435e-ll 717-732 








PR00320C 13.01 2.800e- 








10 635-650 PR00320C 








13.01 6.400e-10 593- 








608 PR00320B 12.19 








3.250C-09 593-60B 


675 


PR00320 


G-PROTEIN BETA WD-40 


PR00320A 16.74 4.857e- 






REPEAT SIGNATURE 


13 572-587 PRO032OB 








12.19 4.11Se-12 614- 
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SEO ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 








629 PR00320C 13.01 
8.435e-ll 696-711 
PR00320C 13.01 2 . 800e- 
10 614-629 PR00320C 
13.01 6.400e-10 572- 
587 PR00320B 12.19 
3.250e-09 572-587 


676 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 9.667e- 
09 249-263 


679 


PF00642 


Zinc finger C-x8-C-x5-C- 
x3-H type (and similar) . 


PF00642 11.59 3.700e- 
16 225-236 PF00642 
11.59 7.900e-12 187- 
198 


680 


PR00308 


TYPE I ANTIFREEZE 
PROTEIN SIGNATURE 


PR00308C 3.83 8.754e- 
10 286-296 


681 


BL00019 


Actinin-type actin- 
binding domain proteins . 


BL00019D 15.33 4.200e- 
19 227-257 


682 


PR00700 


PROTEIN TYROSINE 
PHOSPHATASE SIGNATURE 


PR00700D 12.47 4.000e- 
09 99-118 


687 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 8.500e- 
10 538-553 


S89 


BLO1024 


Protein phosphatase 2A 
regulatory subunit PR55 
proteins. 


BL01024A 10.26 l.OOOe- 
40 22-69 BL01024B 
8.91 l.OOOe-40 86-127 
BL01024C 7.80 l.OOOe- 
40 146-185 BL01024D 
13.22 l.OOOe-40 185- 
222 BL01024E 11 . 96 
l.OOOe-40 222-266 

■qt r>-| fi'y/t Q A i -w nrtriA 

x5liUXUJT4r y . *kz. l.OOue- 
40 266-317 BL01024G 

349 BL01024H 13.88 
l.OOOe-40 389-442 


691 


BL00027 


'Horaeobox' domain 
proteins. 


DU\J \I \J & / ^1 w - T .» O . VJ f .1-" — 

31 152-195 


692 


BL00211 


ABC transporters family 
nrotp *i t\fi 


BL00211A 12.23 5.050e- 
09 45-57 


693 


BL002I1 


ABC transporters family 
proteins . 


BL00211A 12.23 5.050e- 
09 45-57 


694 


BL00211 


ABC transporters family 
proteins . 


BL00211A 12.23 S.OSOe- 
09 58-70 


696 


BL006 80 


Methionine 

aminopeptidase subfamily 
1 proteins. 


PTflflCfln 14 11 tr ifiA« 

DijUIIDDU _L t . J / r> , JUig 

17 173-195 


697 


BL00741 


Guanine - nucl eot i de 
dissociation stimulators 
CDC24 family sign. 


BL00741B 14.27 3_418e- 
11 242-265 


698 " 


DM01930 


2 kw FINGER SMCX SMCY 
YDR096W. 


DM01930B 15.41 1.367e- 
37 170-215 DM01930F 
14.16 B.232e-28 267- 
303 DM0193 0B 19.86 
9.163e-l0 37-71 


700 


PR00869 


DNA- POLYMERASE FAMILY X 
SIGNATURE 


PR00869A 12.80 1.281e- 
16 245-263 


701 


PRO 004 8 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR0004 8A 10.52 2.174e- 
10 77-91 PR00048A 
10.52 6.870e-10 133- 
147 PR00048A 10.52 
8.826e-10 105-119 
PR00048A 10.52 5.320e- 
09 161-175 


702 


BL00523 


Sulfatases proteins. 


BL00523E 19.27 2.565e- 
25 326-356 BL00523A 
13.36 5.050e-lfi 38-55 
BL00523B 8.64 5.909e- 
15 86-98 BL00523C 
12.64 5.500e-13 137- 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 


/ 






1.844e-ll 290-302 
BL00523G 9.46 5.500e- 
10 513-523 BL00523F 
10.85 6.351e-09 413- 
424 


703 


PRO 004 8 


C2H2-TYPB ZINC FINGER 
SIGNATURE 


PR00048A 10.52 8.412e- 
12 376-390 PR00048B 
6.02 1.000e-10 334-344 
PR00048B 6.02 1.474©- 
09 364-374 


707 


PD00787 


SYNTHASE BIOSYNTHESIS 
TRANSFERASE . 


PD0fl7fi7A 14 fi4 A 941i=»-, 

14 66-82 


708 


PRO 0761 


BIND IN PRECURSOR 
SIGNATURE 


PR0D761E 14.32 8.500e- 
10 822-841 


712 


DM01354 


kw TRANSCRIPTASE REVERSE 
II ORF2. 


DM01354Y 10.69 4.977e- 
38 425-465 DM01354X 
13.86 7.300e-34 376- 
415 DM01354V 12.97 
4.923e-17 311-358 
DM01354W 12.64 5.596e- 
10 3=>o-3 /6 


713 


BL00O39 


DEAD -box subfamily ATP- 
dependent hel icases 
proteins. 


BL00039D 21.67 7.54Se- 
27 4SO-496 BL00039A 
18.44 2.537e-18 147- 
186 BL00039C 15.63 
2.2l6e-14 2BO-304 
BL00039B 19.19 1.947e- 
1J 194—220 


715 


BL00383 


Tyrosine specific 
protein phosphatases 
proteins . 


BL00383B 10.35 4.981e- 
10 150-161 


717 


PF00777 


Sialyl transferase 
family. 


PF00777C 18.60 4.0356- 
21 106-161 


718 


DM00031 


IMMUNOGLOBULIN V REGION. 


DM00031A 16.80 3.750e- 
39 20-68 DM00031B 
15.41 2.688e-28 84-118 
DM00031C 12.79 1.300e- 
12 131-142 


719 


BL0024 3 


Integrins beta chain 
cysteine-rich domain 
proteins. 


BL00243B 17.54 l.OOOe- 

40 131-172 BL00243C 

16.42 1.000e-40 172- 

208 BL00243D 24.07 

1.000e-40 222-274 

BL00243F 22.63 l.OOOe- 

40 314-358 BL00243I 

31.77 6.571e-39 607- 

650 BL00243E 16.70 

3.077e-35 274-304 . 

BL00243G 21.38 3.625e- 

34 35B-400 RT»ftn?4^H 
jtx j j d — iuu au\i \j jl x j n 

17.53 5.235e-29 567- 
593 BL00243A 17.61 
3.250e-21 63-84 
BL00243H 17.53 7.167e- 
16 477-503 BL00243H 
17.53 2.304e-ll 524- 
550 BL00243H 17.53 
5.304e-ll 606-632 
BL00243I 31.77 1.380e- 
09 610-653 


720 


PRO 02 17 


43 KD POSTSYNAPTIC 
PROTEIN SIGNATURE 


PR00217C 10.91 B.022e- 
09 20-36 


722 


PRO 07 04 


CALPAIN CYSTEINE 
PROTEASE (C2) FAMILY 
SIGNATURE 


PR00704D 11.05 5.909e- 
34 135-161 PR00704F 
13.61 7.O00e-26 190- 
218 PR00704E 12.55 
8.07le-26 165-189 
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SJEQ ID NO: 


ACCESSION 

NO. 


DESCRIPTION 


RESULTS* 








PR00704B 17.94 2.241e- 
23 75-98 PR00704A 
14.68 4.094e-l9 30-54 
PROD704C 11.88 1.87le- 
18 99-116 


725 


PR00194 


TROPOMYOSIN SIGNATURE 


PR00194A 7.86 7.652e- 
09 169-187 


726 


PR00194 


TROPOMYOSIN SIGNATURE 


PR00194A 7.86 7.652e- 
09 169-187 


727 


PR00320 


G- PROTEIN BETA WD- 40 
REPEAT SIGNATURE 


PR00320C 13.01 2.l25e- 
13 277-292 PR00320A 
16.74 1.310e-ll 277- 
292 PRO032OC 13.01 
4.522e-ll 323-338 
PRO 03 2 OA 16.74 6 . 586e- 
11 323-338 PR0032OB 
12.19 4.3436-10 323- 
338 PR00320B 12.19 
6.914O-10 277-292 


731 


PRO 019 5 


DYNAMIN SIGNATURE 


PR00195A 11.94 8.627e- 
16 288-307 PR00195E 
9.82 3.912e-ll 457-474 


733 


PF00642 


zinc finger c~x8-c-x5-C- 
X3-H type (and similar) . 


PF00642 11.59 9.082e- 
10 787-798 


738 


BL00039 


DEAD -box ^ubfarailv ATP- 
dependent hel leases 
proteins , 


28 26-65 BL00039D 
21.67 2 105e-2O 338- 
384 BL00039C 15.63 
9.100e-13 160-184 
BL00039B 19.19 9.617e- 
11 73-99 


739 


BL01289 


TSC-22 / dip / bun 
family proteins. 


BL01289A 12.18 8.909e- 
31 326-353 BL01289B 
10.45 9.571e-17 353- 
383 


742 


BL01019 


ADP-ribosylation factors 
family proteins. 


BL01019A 13.20 7.078e- 
12 41-81 


743 


BL00965 


Phosphomannose isomerase 
type I proteins. 


BL00965C 23.78 l.OOOe- 
40 256-305 BL00965B 
17.77 1.600e-25 126- 
153 BL00965A 10.57 
6.400e-19 94-113 


74 7 


BL00021 


Kringle domain proteins. 


BL00021D 24.56 4.563e- 
25 231-273 BL00021B 
13.33 5.345e-2l 60-78 


748 


BL00612 


Osteonectin domain 
proteins. 


BL00612B 11.35 2.034e- 
11 93-126 


749 


PR0045O 


RE COVER IN FAMILY 
SIGNATURE 


PR00450C 12.22 6.880e- 
10 135-157 


752 


BL00795 


Involucrin proteins. 


BL00795C 17.06 6.000e- 
11 384-429 BL00795C 
17.06 9.444e-ll 370- 
415 


754 


BL00051 


Riboeomal protein L39e 
proteins. 


BL00051 20.92 1.935e- 
16 4-50 


755 


DM01970 


0 kw ZK632.12 YDR313C 
ENDOSOMAL III. 


DM01970B 8.60 7.723e- 
09 171-184 


760 


BL0102O 


SARI family proteins. 


BL01020C 15. 3S 9-020e- 
12 99-150 


762 


3L00046 


Histone H2A proteins. 


BL00046 12. 95" l.OOOe- 
40 33-88 


763 


PD02411 


PROTEIN TRANSCRIPTION 
REGULATION NUCLEAR. 


PD02411 21.89 9.137e- 
10 206-240 


764 


BL00027 


• Horaeobox * domain 
proteins. 


BL00027 26.43 8.800e- 
29 417-460 


767 


BL01208 


VWFC domain proteins. 


BL01208B 15.83 6.063e- 
10 309-324 BL01208B 
15.83 8.031e-10 165- 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 








180 BL01208B 15.83 
4.162e-09 85-100 


770 


BL00031 


Nuclear hormones 

re cep t or s DNA— buidmg 

region proteins. 


BL00031A 19.55 9.571e- 

22.25 5.500e-27 242- 
274 


772 


PR00449 


TRANSFORMING PROTEIN P21 
RAS SIGNATURE 


PR00449A 13.20 1.450e- 
18 4-26 PR00449E 
13.50 3.520e-14 142- 
165 PR00449C 17.27 
3.032e-13 44-67 
PR00449D 10.79 8.579e- 
13 107-121 PR00449B 

-| a H 1 ACCo.11 AA 


773 


BL00523 


Salfatases proteins. 


BL00523B 19.27 9.333e- 
23 299-JjiU BJj00523A 
13.36 2.200e-13 47-64 
BI;v)05z3d o.o* Z.bU /6- 
13 91-103 BL00523D 

BLC0523C 12.64 4.512e- 
10 141-152 BL00523F 
10.85 5.821e-10 373- 


775 


BL00028 


Zinc finger, C2H2 type, 
domain proteins. 


31*00028 16.07 7.6B6e- 
09 568-585 


776 


BL00028 


Zinc finger, C2H2 type, 
domain proteins , 


BL00028 16.07 7.686e- 
09 621-638 


111 


BL00O28 


Zinc finger, C2H2 type, 
domain proteins. 


BL00028 16.07 7.686e- 
09 595-612 


ne 


BL00030 


EuJcaxyotic RNA-binding 
region RNP-1 proteins. 


BL00030A 14.39 8.4l2e- 
11 322-341 BL00030A 
14.39 7.000e-10 220- 
239 


119 


PRO 0079 


GLUCOSE- 6 -PHOSPHATE 
DEHYDROGENASE SIGNATURE 


PR00079B 12.98 2.929e- 
26 193-222 PR00079E 
16.65 4.150e-23 348- 
375 PR00079C 8.68 
6.351e-l6 246-264 
PR00079D 13.51 7.070e- 
16 264-281 PR00079A 
16.12 &./69C-1J 169— 
183 


781 


BL00215 


Mitochondrial energy 
transfer proteins. 


BLD0215A 15.82 9.25De- 
17 10-35 BL00215A 
15.82 6.000e-16 221- 

Z4b DliUU/ll V»\ i.J.()i! 

7.857e-12 108-133 
BL00215B 10.44 9.526e- 
11 168-181 


783 


PD002B9 


PROTEIN SH3 DOMAIN 
REPEAT PRESYNA. 


PD00289 9.97 6.276e-09 
159-173 


7B5 


BL00690 


DEAH-box subfamily ATP- 
proteins . 


BL00690B 13.38 l.OOOe- 
12 147-165 BLQ0690A 
6.87 5.320e-10 114-124 
BL00690C 7.51 3.189e~ 
09 218-228 


78rf 


PRO 04 49 


TRANSFORMING PROTEIN P21 
RAS SIGNATURE 


PR00449C 17.27 8-SOOe- 
16 50-73 PR00449A 
13.20 5.235e-14 8-30 
PR00449E 13.50 2.853e- 
11 150-173 PR00449D 
10.79 1.545e-09 111- 
125 


788 


DM01206 


CORONAVIRUS NUCLEOCAPSID 
PROTEIN. 


DM01206B 10.69 8.767e- 
10 1-21 


790 


BL00915 


Phosphatidylinositol 3- 
and 4 -kinases proteins. 


BL00915C 22.43 9.182e- 
39 725-764 BL00915B 
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SKQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 








22.78 5.050e-33 633- 
671 BL0091SD 27.02 
1.529e-21 795-831 
BL00915A 10.09 l.OOOe- 
13 395-407 


791 


PR00208 

/ 


GLIADIN AND LMW GLUTEN IN 
SUPERFAMILY SIGNATURE 


PR00208A 12.59 6.294e- 
10 120-138 PR00208A 
12.59 6.294e-10 121- 
139 PR00208A 12.59 
6.294e-10 122-140 
PR00208A 12.59 6.294e- 
10 123-141 PR002O8A 
12.59 6.294e-10 124- 
142 PR00208A 12.59 
6.294e-10 125-143 
PR00208A 12.59 6.294e- 
10 126-144 PR00208A 
12.59 6.294e-10 127- 
145 PR00208A 12.59 
6.294e-10 128-146 
PR00208A 12.59 6.294e- 
10 129-147 PR00208A 
12.59 7.411e-09 130- 
148 PR00208A 12.59 
7.658e-09 131-149 
PR00208A 12.59 7.904e- 
09 132-150 PR00208A 
12.59 8.274e-09 118- 
136 PR00208A 12.59 
B.274e-09 119-137 


795 


FR00205 


CADHERIH SIGNATURE 


PR00205B 11.39 5.034e- 
16 302-320 PR00205A 
14.73 1.257e-ll 284- 
300 PR00205C 13.65 
1.333e-ll 337-352 


796 


BL00412 


Neuromodulin (GAP -43) 
proteins . 


BL0O412D 16.54 4.000c 
12 196-247 BL00412D 
16.54 5.705e-ll 197- 
248 BL0Q412D 16.54 
7.848e-10 199-250 
BL00412D 16.54 l.B27e- 
09 195-246 BI,00412D 
16.54 1.918e-09 194- 
245 BL00412D 16.54 
2.102e-09 201-252 


797 


BI*00021 


Kr ingle domain proteins. 


BLQ0021B 13.33 6.339e- 
13 40-58 


799 

• 


BL01052 


Calponin family repeat 
proteins . 


BL01052C 18.51 l.OOOe- 
40 87-127 BL01052A 
16.12 l.biiye-J2 J-Jb 
BL01052B 15.31 1.257e- 
25 52-78 BL01052D 

in o<t q 7"H7t=-?«i 174- 
194 


800 


BL0034S 


p53 tumor antigen 
proteins . 


BL00348F 23.19 3 . 714e- 

ft CJ 1 Q"7„ OA ft 


801 


BL00309 


Vertebrate galactoside- 
binding lectin proteins . 


BL00309C 18.65 1.621e- 
09 62-87 


802 


PR00245 


OLFACTORY RKCEPTOR 
SIGNATURE 


PR00245D 10.47 5.224e- 
09 187-199 


804 


PF00774 


Dihydropyridine 
sensitive L-type calcium 
channel (Beta subuni. 


PP00774A 16.47 8.457e- 
10 110-156 


808 


PRO0667 


RETINAL PIGMENT 
EPITHELIUM-RETINAL GPCR 
SIGNATURE 


PR00667C 11.71 9.875e- 
09 12-28 


BIO 


PD02346 


PHOTOSYSTEM II PROTEIN 
PRECURSOR 


PD02346F 12.89 4,340e- 
09 317-354 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 






PHOTOSYNTHESIS. 




Bll 


BL00685 


CBP-A/NF-YB aubunit 
proteins . 


BL00685B 14.41 6.779e- 
14 54-95 BL00685A 
11.22 4.798e-13 5-54 


812 


PR00080 


ALCOHOL DEHYDROGENASE 
[SUPBRFAMILY SIGNATURE 


PR0O08OA 9.32 9.419e- 
10 93-105 


813 


BL00357 


| Histone H2B proteins. 


BL00357 7.74 l-988e-17 
22-65 


815 


PD00066 


PROTEIN ZINC-FINGER 
METAL- BIND I . 


PD00066 13.92 7.923e- 
15 158-171 PD00066 
13 92 5 ?00f»-14 4£-c;Q 
PD00066 13.92 7.000e- 
•14 18-31 PD000G6 
13.92 7.0O0e-l3 130- 
143 PD00066 13.92 
7.500e-13 214-227 
PD00066 13.92 9.000e- 
13 102-115 PD00066 
13.92 4.429e-l2 186- 
199 PD00066 13.92 
1.783e-ll 74-87 


816 


BL01195 


Peptidyl-feRNA hydrolase 
proteins . 


BL01195C 20.12 3.34 8e- 
20 100-139 


820 


BLC0520 


Interleukin-10 family 
proteins. 


BL00520A 6.21 6.47le- 
09 1-14 


822 


BL00972 


Ubiquitin carboxyl- 
terminal hydrolases 
family 2 proteins. 


BL00972A 11.93 8.113e- 
09 224-242 


825 


PR00876 


NEMATODE MBTALLOTHIONEIN 
SIGNATURE 


PR00876B 7.66 2.268e- 
10 101-115 


829 


PD02855 


FLAVOPROTEIN PROTEIN 
DNA/PANTOTHEN. 


PD02855A 18.37 4.732c- 
28 88-124 PD02855B 

O.JO D.9/oe - tf7 jlj« — ±'±d 


830 


PR00405 


HIV REV INTERACTING 
PROTEIN SIGNATURE 


PR004 05B 11.83 7.000e- 
21 44-62 PR00405C 
19.41 1.000e-13 65-87 
PR00405A 17.71 7.283e- 
13 25-45 


831 


?R00019 


LEUCINE -RICH REPEAT 
SIGNATURE 


PR00019A 11.19 l.OOOe- 
09 47-61 PROO019B 
11.36 1.720e-09 136- 

J — JU rJ\.UUUJL7D i JL * 3D 

3.880e-09 44-58 


832 


PR00011 


TYPE III EGP-LIKE 
SIGNATURE 


16 164-183 PROOOllD 
14.03 6.850e-16 164- 
183 PROOOllA 14.06 
8.364C-14 164-183 
PROOOllC 24.25 5.415e- 
12 231-260 PROOOllD 
14.03 9.852e-ll 212- 
231 


834 


PD0O306 


PROTEIN GLYCOPROTEIN 
PRECURSOR RE. 


PD00306A 10.26 7.000e- 
12 232-246 


835 


PD00306 


PROTEIN GLYCOPROTEIN 
PRECURSOR RE. 


PD00306A 10.26 4.000e- ! 
10 290-304 


836 


PD00306 


PROTEIN GLYCOPROTEIN 
PRECURSOR RE. 


PD00306A 10.26 7.000e- 
12 216-230 


837 


DM00215 


PROLINE- RICH PROTEIN 3. 


DM00215 19.43 3.898e- 
09 78-111 


839 


PD02784 


PROTEIN NUCLEAR 
RIBONUCLEOPROTEIN . 


PD02784B 26.46 8.302e- 
09 73-116 


840 


PRO 0700 


PROTEIN TYROSINE 
PHOSPHATASE SIGNATURE 


PR007O0B 16.80 5.091e- 
22 369-390 PR00700D 
12.47 5.765e-21 491- 
510 PR00700C 13.17 
4.750e-14 449-467 
PR00700P 11.18 8.500e- 
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11 538-549 PR00700E 
17.57 3.100e-l0 522- 
538 


841 


PR00109 


TYROSINE KINASE 
CATALYTIC DOMAIN 
SIGNATURE 


PR00109B 12.27 5.404e- 
13 134-153 


844 


PD02785 


PROTEIN RIBOSOMAL 60S 
L22 RNA-BINDING HEP. 


PD02785B 14.43 l.OOOe- 
40 58-112 PD02785A 
15.23 1.915e-28 8-57 


845 


BLC0826 


MARCKS family proteins. 


BL00826C 7.63 6.738e- 
09 203-230 


846 


BL00518 


Zinc finger, C3HC4 type 
(RING finger), proteins. 


BL00518 12.23 4.429e- 
10 15-24 


849 


BL00S18 


Zinc finger, C3HC4 type 
(RING finger) , proteins. 


BL00518 12.23 l.OOOe- 
08 340-349 


850 


PR00308 


TYPE I ANTIFREEZE 
PROTEIN SIGNATURE 


PR00308A 5-90 6.506e- 
09 12-27 


851 


PD02411 


PROTEIN TRANSCRIPTION 
REGULATION NUCLEAR. 


PD02411 21.89 7.000e- 
16 246-280 


852 


BL00420 


Speract receptor repeat 
proteins domain 
proteins . 


BL00420B 22.67 l.OOOe- 
40 723-778 BLO0420B 
22.67 1.321e-38 933- 
988 BL00420B 22.67 
8.457e-28 482-537 
BL00420B 22.67 4.500e~ 
27 587-642 BL00420B 
22.67 9.625e-27 270- 
325 BL00420B 22.67 
4.205e-26 163-218 
BL00420B 22.67 5.731e- 
23 55-110 BL00420B 
22.67 6.464e-20 377- 
432 BL00420B 22.67 
2.8O0e-15 830-885 
BL00420C 11.90 1.900e- 
13 355-366 BL00420C 
11.90 1.900e-12 808- 
819 BL00420C 11.90 
3.550e-12 248-259 
BL00420C 11.90 2.831e- 
11 141-152 BL00420C 
11.90 5.119e-ll 1018- 
1029 BL00420C 11.90 
7.955e-10 567-578 ■ 


853 


BL00420 


Speract receptor repeat 
proteins domain 
proteins. 


BL00420B 22.67 1.000c- 
40 756-811 BL0042DB 
22.67 1.32le-38 966- 
1021 BLOO420B 22.67 
8.457e-28 482-537 
BL00420B 22.67 4.500e- 
27 620-675 BL00420B 
22.67 9.625e-27 270- 
325 BL00420B 22.67 
4.20Se~26 163-218 
BL00420B 22.67 5.73le- 
23 55-110 BL00420B 
22.67 6.464e-20 377- 
432 BL00420B 22.67 
2_800e-15 863-918 
BL00420C 11.90 1.900e- 
13 355-366 BL00420C 
11.90 1.900e-12 841- 
852 BL0042GC 11.90 
3.550e-12 248-259 
BL00420C 11.90 2.83le- 
11 141-152 BL0042OC 
11.90 5.119e-ll 1051- 
1062 BL00420C 11.90 
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NO. 
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7.95Se-10 567-578 


857 


PRO 03 8 8 


3 1 , 5 ' -CYCLIC NUCLEOTIDE 
CLASS II 

PHOSPHODIESTERASE 
SIGNATURE 


PR00388A 10.45 2.778e- 
09 64-83 




BL00030 


Eukaryotic KNA- binding 
region RKTP-1 proteins. 


BLOO030A 14.39 2.929e- 
13 37-56 BL00030B 
7.03 1.900e-ll 167-177 
BL00030A 14.39 2.000e- 
10 128-147 


861 


PRO 098 8 


URIDINE KINASE SIGNATURE 


PR00988A 6.39 4.250e- 
17 23-41 PR00988C 
13.64 6.714e-16 107- 
123 PR00988F 12.23 
7.828e-15 198-212 
PR00988E 8.27 9.769e- 
12 176-188 PR00988D 
5.95 8.250e-ll 163-174 
PR00988B 11.60 4.512e- 
10 60-72 


863 


BL00215 


Mitochondrial energy 
transfer proteins. 


BL00215B 10.44 8.071e- 
12 41-54 


864 


PR00775 


90 KD HEAT SHOCK PROTEIN 
SIGNATURE 


PR0O775E 8.06 l.OOOe- 
24 198-221 PR00775B 
3.52 1.837e-23 107-130 
PR00775D 8.91 4.484e- 
17 171-189 PRO0775A 
9.90 8.342e-17 86-107 
PR00775C 10.68 9.379e- 
17 153-171 PR00775G 
10.64 6.850e-15 267- 
286 PR00775F 12.76 
6.769e-14 249-267 


866 


DM01688 1 2 POLY-IG RECEPTOR . 

i 


DM01688G 16.45 9.460e- 
09 89-121 


867 


PD01066 


PROTEIN ZINC FINGER 
ZINC- FINGER METAL- 
BINDING NU. 


PD01066 19.43 5.596e- 
29 14-53 


868 


BL012B7 


RNA 3' -terminal 
phosphate cyclase 
proteins . 


BL01287A 17.95 2.68Be- 
26 16-48 


869 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 6.464e- 
10 304-337 


872 


BL00046 


Hlstone H2A proteins. 


BL00046 12.95 l.OOOe- 
40 30-85 


874 


BL00188 


Biot in- requiring enzymes 
attachment site 
proteins . 


BL00188 30.29 9.036e- 
32 665-711 


876 


BLG0028 


Zinc finger, C2H2 type, 
domain proteins. 


BL00028 16.07 7.686e- 
09 298-315 


877 


PD02102 


SUBUNIT E V-ATPASB 
VACUOLAR ATP SYNTHASE 
HYDROL. 


PD02102A 16.74 4.176e- 
10 97-141 


879 


BL01189 


Ribosomal protein S12e 
proteins. 


BL01189A 14.27 l.OOOe- 
40 35-71 BL01189B 
13.49 1.000e-40 71-125 


882 


BL00284 


Serpins proteins. 


BL00284C 28.56 6.400e- 
25 62-104 BL00284B 
17.99 6.l82e-12 35-56 


B89 


BL00216 


Sugar transport 
proteins . 


BL00216B 27.64 4.375e- 
21 35-85 


896 


PR00391 


PHOS PHATID YLINOS1TOL 
TRANSFER PROTEIN 
SIGNATURE 


PR00391E 12.50 7.785e- 
15 211-231 PR00391B 
8.39 1.000e-13 83-104 
PR00391D 12.21 9-328e- 
13 191-207 PR00391A 
7.83 5.390e-ll 16-36 


897 


PR00327 


ICE NUCLEATION PROTEIN 


PR00327C 6.37 5.247e- 
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NO. 


DESCRIPTION 


RESULTS* 






SIGNATURE 


09 313-328 


898 


BL0O039 


DEAD- box subfamily ATP- 
dependent he 1 leases 
proteins . 


BL00039D 21.67 7.800e- 
26 386-432 BL00039A 
18.44 6.674e-16 113- 
152 BL00039B 19.19 
1.947e-13 153-179 
BL00039C 15.63 9.4 60e~ 
11 236-260 


901 


PD0OD66 


PROTEIN ZINC-FINGER 
METAL- BlNDI . 


PD00066 13.92 8.200e~ 
16 254-267 PD00066 
13.92 8.200e-16 282- 
295 PD00066 13.92 
8.200e-16 310-323 
PD00066 13.92 8.200e- 
16 366-379 PD00066 
13.92 8.200e-16 394- 
407 PD00066 13.92 
8.20Oe-14 338-351 


902 


BL01115 


GTP-binding nuclear 
protein ran proteins. 


BL01115A 10.22 9.321e- 
11 6-50 


903 


PR00806 


VINCULIN SIGNATURE 


PR00806B 4.28 9.l60e- 
09 97-111 


904 


PRO 03 81 


KINESIN LIGHT CHAIN 
SIGNATURE 


PR00381E 8.75 6.586e- 
25 335-356 PR00381B 
18.17 2.667e-24 204- 
224 PR00381A 9.55 
2.800e-24 107-125 
PR00381C 12.48 4 . 522e~ 
24 226-245 PR00381D 
13.94 1.084e-22 291- 
309 PR00381F 9.13 
3.288e-22 370-392 
PR00381F 9.13 7.l81e- 
13 286-30B PR00381B 
8.75 4.066e-ll 251-272 
PR00381E 8.75 7. 03 Se- 
ll 293-314 PR00381E 
8,75 8.364e-10 377-398 
PR00391D 13.94 5.230e- 
09 333-351 PR00381C 
12.48 7.120e-09 310- 
329 


906 


PRO 034 5 


STATHMIN FAMILY 
SIGNATURE 


PR00345C 4.54 8.557e- 
09 525-549 


907 


PRO 0345 


STATHMIN FAMILY 
SIGNATURE 


PR0Q345C 4.b4 «.5b/e- 
09 513-537 


908 


BL00678 


Trp-Asp (WD) repeat 
proteins proteins. 


BL00678 9.67 9.308e-ll 
144-155 


910 


rDU 10 66 


PROTEIN ZINC FINGER 
ZINC-FINGER METAL - 


PDUlUoD 13. 4 3 ^.BUOe- 
30 48-87 


912 


BL01104 


Ribosomal protein Ll3e 
proteins . 


BL01104C 15.14 6.000e- 
09 364-392 


922 


3LO067B 


Trp-Asp (WD) repeat 
proteins proteins. 


BL00678 9.67 3.842e-09 
500-511 


923 


PRO 03 20 


G- PROTEIN BETA WD- 40 
KhrcAl SIGNATURE 


PR00320C 13.01 2.500e- 
no ill lift MftnTJAi 1 

13.01 5.500e-09 187- 
202 


924 


PD02181 


PROTOCHLOROPHYLLIDB 
REDUCTASE PH0T0SYNT. 


PD02181D 12.85 8.609e- 
09 36-64 


926 


BL00019 


Actinin-type actin- 
binding domain proteins. 


BL00019C 14.66 7.453e- 
25 108-144 BL00019B 
13.34 6.510C-11 61-84 
BL00019D 15.33 9-338e- 
11 205-235 BL00019A 
12.56 2.373e-10 34-45 


928 


BL00678 


Trp-Asp (WD) repeat 


BL00678 9.67 9.308e-ll 



222 



WO 01/53312 PCT/US00/34263 



SEQ ID NO: 


ACCESSION 
NO. 
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proteins proteins. 


273-284 BL00678 9.67 
1.600e-10 314-325 
BL00678 9.67 7.600e-10 
360-371 BL00678 9.67 
8.579e-09 206-217 


929 


BL00518 


Zinc finger, C3HC4 type 
(RING finger), proteins. 


BL00518 12.23 1.857e- 
10 137-146 


930 


BL01085 


Ribulose-phosphate 3- 
epiroerase family 
proteins . 


BL01085D 16.55 4.600e- 
24 134-165 BL01085B 
10.15 5.680e-22 30-52 
BL01085E 18.87 8.676e- 
20 172-202 BL01085C 
21.81 2.03Be-14 66-97 


931 


BL01Q85 


Ribulose -phosphate 3- 
epimerase family 
proteins. 


BLC1085D 16.55 4.600e- 
24 152-183 BL01085B 
10.15 5.680e-22 30-52 
BL01085E 16.87 8.676e- 
20 190-220 BL0108SC 
21.81 2.038e-14 66-97 


933 


PDOO301 


PROTEIN REPEAT MUSCLE 
CALCIUM- B I . 


PD00301A 10.24 6.400e- 
09 160-171 


936 


PF00168 


C2 domain proteins. 


PF00168C 27.49 4.000e- 
12 336-362 


93 7 


BL00415 


Synapsins proteins. 


BL00415N 4.29 9.519e- 
10 5-49 


94 0 


PR00862 


PROLYL OLIGOPEPTIDASB 
SERINE PROTEASE (S9A) 
SIGNATURE 


PR00862D 16.17 4.086e- 
09 63-84 


945 


BL01230 


RNA methyl transferase 
trmA family proteins. 


BL01230B 11.62 2.373e- 
09 407-420 


948 


BL00479 


Phorbol esters / 
diacylglycerol binding 
domain proteins. 


BL00479B 12.57 7.429e- 
18 52-68 BL00479A 
19.86 2.200e-13 26-49 


949 


BL00678 


Trp-Asp (WD) repeat 
proteins proteins. 


BL00678 9.67 1.474e~09 
100-111 


954 


PD01311 


PROTEIN OXIDORBDUCTASE 
NAD INTERGENIC RE. 


PD01311A 30.23 5.909e- 
10 66-111 


955 


PF00651 


BTB (also known as BR- 
C/Ttk) domain proteins. 


PF006S1 15.00 3.250e- 
12 47-60 


956 


PF00651 


BTB (also known as BR- 
C/Ttk) domain proteins. 


PF00651 15.00 3.250e- 
12 47-60 


957 


BL00379 


CDP- alcohol 

phosphatidyl trans f erases 
proteins . 


BL00379 24.^4 l.^lOe- 
15 111-148 


959 


BL01115 


GTP- binding nuclear 
protein ran proteins. 


BL01115A 10.22 1.884e- 
10 31-75 


960 


BL01115 


GTP- binding nuclear 
protein ran proteins. 


BL01115A 10.22 3.438e- 
14 110-154 


962 


BL00061 


Short -chain 

dehydrogenase e/ reductase 
s family proteins. 


BL00061B 25.79 6.586e- 
13 198-236 


963 


PR00502 


MUTT DOMAIN SIGNATURE 


PR00502A 15.06 8.200e- 
11 210-225 


966 


PR00308 


TYPE I ANTIFREEZE 
PROTEIN SIGNATURE 


PR00308A 5.90 7.035e- 
09 55-70 


967 


DM01206 


CORONAVIRUS NUCLEOCAPSID 
PROTEIN. 


DM01206B 10.69 1.2B6e- 
12 104-124 DM01206B 
10.69 5.299e-ll 23-43 
DM01206B 10.69 8.274e- 
10 73-93 DM01206B 
10.69 3.962e-09 108- 
128 DM0120 6B 10.69 
5.671e-09 38-58 


969 


PF01008 


Initiation factor 2 
subunit. 


PF01008B 25.59 4.724e- 
31 417-460 PF01O08C 
12.25 5.333e-l8 506- 
526 PF01008A 20.14 
5.875e-15 369-390 
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970 


BL01277 


Ribonuciease PH 
proteins . 


BL01277C 10.18 7.648e- 
10 112-143 BL01277A 
17.39 9.806e-10 40-78 


975 


BL01159 


WW/rspS/WWP domain 
proteins . 


BL01159 13.85 3.605e- 
12 130-145 BL01159 
13.85 4.122e-10 171- 
186 


977 


PF00791 


Domain present in ZO-1 
and Unc5-like netrin 
receptors . 


PF00791C 20.98 2.235e- 
09 55-94 


978 


BL01167- 


Ribosomal protein LI 7 
proteins . 


BL01167B 20.6^ 8.258e- 
19 88-127 


979 


BL00478 


LIM domain proteins. 


BL00478B 14.79 9.357e- 
13 33-48 BL00478B 
14.79 7.250e-12 98-113 


980 


PR00312 


CALSEQOBSTRIN SIGNATURE 


PR00312E 8.32 3.423e- 
36 169-199 PR00312I 
15.78 5.286e-35 332- 
361 PR00312P 15.06 
5.865e-35 199-229 
PR00312H 13.31 8.313e- 
35 263-291 PR00312J 
13.73 5.688e-34 363- 
392 PR00312D 9.43 
■2.b3be-JJ 128-158 
PR00312C 15.14 8.839e- 
33 92-122 PR00312B 
15.08 8.941e-33 62-92 

cnnnn *>r 1 1 n c cc~j^ 
rnvvji^ij J.J...L.L b.oj/e- 

32 230-258 PR00312A 

11.70 6.9l4e-27 35-59 


981 


TP00992 


Troponin . 


PF00992A 16.67 8.816e- 
09 414-449 


982 


PRO 02 9 9 


ALPHA CRYSTALLIN 
SIGNATURE 


PR00299F 13.20 2.367e- 
09 127-149 


983 


BL01150 


Respiratory- chain NADH 
dehydrogenase 20 Kd 
subunit proteins. 


BL01150B 17.16 l.OOOe- 
40 156-202 BL01150A 
14.10 8.200e-39 100- 
138 


986 


dLiUu /Hi> 


Involuc rin pro t c in s . 


BL00795C 17.06 7.211e- 
14 4-49 BL0O795C 

J. / .Ut> J. - § (oB-11 J. — 4o 

BL0O795C 17.06 3.407e- j 
10 14-59 R^L0JO*7QI^r , 
17.06 7.802e-10 2-47 
BL00795C 17.06 8.640e- 
10 19-64 BL00795C 
17.06 7.400e-09 11-56 
BL00795C 17.06 7.800e- 
09 3-48 


987 


BL00939 


Ribosomal protein Lie 
proteins. 


BL00939F 17.27 5.393e- j 
09 810-840 


988 


PRO 0452 


SH3 DOMAIN SIGNATURE 


PR00452B 11.65 6.538e- 
11 525-541 


989 


PRO 04 52 


SH3 DOMAIN SIGNATURE 


PR00452B 11.65 6.538e- 
11 497-513 


994 


BL00027 


1 Horoeobox ' domain 
proteins . 


BL00027 26.43 2.500e- ! 
25 146-189 


997 


BL01304 


ubiH/COO.6 monooxygenase 
family proteins. 


BL01304A 8.05 3.893e- 
11 65-79 


998 


DM01767 


5 TRANSMITTER DOMAIN. 


DM01767B 10.07 7.868e- 
09 22-39 


1000 


PR00926 


MITOCHONDRIAL CARRIER 
PROTBIN SIGNATURE 


PR00926C 16.07 1.750e- 
24 73-94 PR00926D 
10.53 3.250e-23 126- 
145 PR00926F 17.75 
6.211e-23 217-240 
PR00926B 11.70 6.625e- 
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20 174-193 PR00926B 
16.07 2.125e-18 24-39 
PR00926A 10.41 l.OOCe- 
15 11-25 PR00926F 
17.75 5.565e-09 120- 
143 


1005 


BL00406 


Actins proteins. 


BL00406B 5.47 l.OOOe- 
40 88-143 BL00406C 
6.75 1.000e-40 147-202 
BL00406D 12.58 3.700e- 
40 270-325 BL00406E 
8.44 7.375e-38 327-377 
BL00406A 9.95 3.348e- 
29 11-46 


1006 


BL00406 


Actins proteins. 


BL00406B 5.47 l.OOOe- 
40 88-143 BL00406C 
6.75 1.000e-40 147-202 
BL00406E 8.44 l.OOOe- 
35 248-298 BL00406A 
9.95 3.348e-29 11-46 


1007 


PRO 03 04 


TAILLESS COMPLEX 
\ POLYPEPTIDE 1 
! (CHAPE RONE) SIGNATURE 


PRD0304D 11.04 8.7146- 
22 384-407 PR00304C 
8.69 4.667e-20 98-118 
PR00304B 11.60 7.577e- 
19 68-87 PR00304A 
9.20 3.382e-l6 46-63 
PR00304E 7.79 6.870e- 
13 418-431 


1009 


PD01066 


PROTEIN ZINC FINGER 
ZINC- FINGER METAL 
BINDING NU. 


PD01O66 19.43 2.929e- 
32 9-48 


1011 


PD01066 


PROTEIN ZINC FINGER 
ZINC- FINGER METAL- 
BINDING NU. 


PD01066 19.43 2.929e- 
32 68-107 


1012 


BL0051B 


Zinc finger , C3HC4 type 
(RING finger) , proteins . 


BL00518 12.23 6.143e- 
10 64-73 


1016 


PD01168 


SYNTHETASE LIGASB 
PROTEIN ALANYL. 


PD01168H 12.08 1.000a- 
11 174-194 


1018 


PD00930 


PROTEIN GTPASE DOMAIN 
ACTIVATION . 


PD00930B 33.72 1.39le- 
32 261-302 PD00930A 
25.62 9.550e-22 157- 
183 


1022 


BL00175 | 


Phoaphoglycerate rautase 
family phosphohietidine 
proteins . 


BL00175A 15.42 5.179e- 
12 6-26 BL00175C 
23.75' 8.062e-10 79-111 


1025 


PRO 03 05 


14-3-3 PROTEIN ZETA 
SIGNATURE 


PR00305D 16.34 1.439e- 
10 158-185 


1026 


BL00353 


HMG1/2 proteins. 


BL00353B 11.47 2.436e- 
18 238-288 BL00353C 
14.83 8.844e-ll 288- ] 
335 


1028 


BL00183 


Ubiqultin-conjugating 
enzymes proteins. 


BL001B3 28.97 1.310o- 
33 43-91 


1033 


PP00580 


UvrD/REP nelicase. 


PF00580A 13.37 4.720e- 
09 111-133 


1034 


PR00413 


HALOACID 

DEHALOGENASE/EPOXIDE 
HYDROLASE FAMILY 

oXoBrt 1 UKK 


PR00413E 15.78 3.429e- 
09 154-171 


1037 


PD01066 


PROTEIN ZINC FINGER 
ZINC- FINGER METAL- 
BINDING NU. 


PD010G6 19.43 9.657e~ 
09 5-44 


1038 


PD01796 


PROTEIN TRANSMEMBRANE 
COBALT ZINC CADMIU. 


PD01796 15.01 4.259e- | 
11 55-82 


1039 


BL00299 


Ubiquitin domain 
proteins . 


BL00299 28.84 9_036e- 
09 17-69 


1040 


PR00970 


ARGININE ADP- 
RIBOSYLTRANSFERASE 


PR00970A 17.73 6.143e- 
20 56-78 PR00970D 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 






SIGNATURE 


9.96 2.l54e-l6 154-171 
PR00970F 12.30 l.OOOe- 
16 224-241 PR00970G 

9.97 9.229e-l5 242-258 
PR00970B 16.37 1.290e- 
13 86-105 PRO097OC 
11.05 1.643e-ll 115- 
130 PR00970E 11.23 
9.820e-ll 202-218 


1042 


BL00678 


Trp-Asp (WD) repeat 
proteins proteins. 


BL00678 9.67 2.200e~10 
243-254 


1043 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 6.786e- 
13 114-128 PR00048A 
10.52 1.000e-09 172- 
186 


1045 


BL0061S 


C-type lectin domain 
proteins . 


BL00615A 16.68 1.720e- 
11 218-236 BL0061SB 
12.25 1.857e-10 317- 
331 


1046 


BL01092 


Adenylate cyclases 
class- I proteins. 


BL01092N 13.54 8.924e- 
lO 3-40 


1047 


BL01216 


ATP- citrate lyase / 
succinyl-CoA ligases 
family proteins. 


BL01216D 21.75 4.316e- 
28 314-344 BL01216A 
13.91 1.000e-10 97-112 


1049 


DM00031 


IMMUNOGLOBULIN V REGION. 


DM00031B 15.41 7.618e- 
12 102-136 


1050 


BL01073 


Ribosomal protein L24e 
proteins . 


BL01073 24.30 l.OOOe- 
40 12-62 


1054 


BL00571 


Amidases proteins. 


BL00571 25.69 5.875e- 
31 160-212 


10S5 


BL00030 


Eukaryotic RNA-binding 
region RNP-1 proteins . 


BLOOQ30A 14.39 5.235e- 
11 98-117 BL00030B 
7.03 4.316e-09 137-147 


1058 


BL00223 


Annexins repeat proteins 
domain proteins. 


BL0Q223C 24.79 8.754e- 
23 262-317 BL00223A 
15.59 9.478e-14 46-80 
BL00223A 15.59 5.557e- 
11 118-152 


1060 


BL00027 


1 Homeobox ' domain 
proteins . 


BL00027 26.43 3.455e- 
35 158-201 


1064 


BL00455 


Putative AMP-binding 
domain proteins. 


BL00455 13.31 6.21le- 
13 280-296 


1065 


PR00019 


LEUCINE -RICH REPEAT 
SIGNATURE 


PR00019A 11.19 2.000e- 
09 115-129 PR00019B 
11.36 3.880e-09 87-101 


1066 


PR00326 


GTP1/OBG GTP- BINDING 
PROTEIN FAMILY SIGNATURE 


PR00326A 8.75 4.600e- 
16 151-172 PR00326C 
9.79 1.290e-14 200-216 
PR00326B 16.74 8.548e- 
14 172-191 PR00326D 
19.09 1.257e-13 217- 
236 


1071 


PD02B70 


RECEPTOR INTERLEUKIN-1 
PRECURSOR. 


PD02870B 18.83 8.51Be- 
11 164-197 


1072 


PF00B56 


SET domain proteins. 


PFO0B56A 26.14 5.976e- 
09 350-387 


1075 


BLQ1009 


Extracellular proteins 
SCp/Tpx- 1 /Ag5/PR- 1/SC7 
proteins. 


BL01009D 14.19 4.300e- 
20 127-148 BL01009A 
13.75 6.586e-13 57-75 
BL01009E 13.50 1.439e- 
11 159-175 


1077 


PR00724 


CARBOXYPEPTIDASE C 
SERINE PROTEASE (S10) 
FAMILY SIGNATURE 


PR00724A 10.91 l.OOOe- 
08 366-379 


1078 


BL00215 


Mitochondrial energy 
transfer proteins. 


BL00215A 15.82 l.OOOe- 
12 170-195 BL00215A 
15.82 7.529e-10 79-104 


1079 


BL00678 1 


Trp-Asp (WD) repeat 


BL00678 9.67 4.316e-09 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION, 


RESULTS* 






proteins proteins. 


298-309 


1081 . 


BL0O326 


Tropomyosins proteins. 


BL00326A 14.01 7.398e- 
10 23-57 


1094 


BI.00460 


Glutathione peroxidases 
selenocysteine proteins . 


BLOO460A 28.67 3-204e- 
18 57-92 BL0O46OB 
9.73 6.400e-13 100-118 
BL00460D 16.89 9.143e- 
12 162-182 BL0046OC 
14.35 5.500e-09 133- 
156 


1095 


PD02811 


PROTEIN PEPTIDE 
REDUCTASE MG448 PILB 
FIMBRIA TRAM. 


PD02811A 20.67 3.017e- 
22 67-105 PD02B11B 
17.07 2.263e-21 118- 
151 PD02811C 13.25 
5.696e-13 154-167 


1096 


PD02811 


PROTEIN PEPTIDE 
REDUCTASE MG448 PILB 
FIMBRIA TRAN. 


PD02811A 20.67 3.017e- 
22 60-98 PD02811B 
17.07 2.263e-21 111- 
144 PD02811C 13.25 
5.696e-13 147-160 


1097 


BL00479 


Phorbol esters / 
diacylglycerol binding 
domain proteins. 


BL00479B 12.57 6.143e- 
09 200-216 


1105 


PF0O881 


Nitroreductase family. 


PF00881A 27.15 9.229e- 
13 111-147 


1109 


PR00449 


TRANSFORMING PROTEIN P21 
RAS SIGNATURE 


PR00449A 13.20 3.077e- 
10 15-37 PR00449E 
13.50 1.857e-09 185- 
208 PR00449D 10.79 
8.364e-09 131-145 


1115 


PRO 04 05 


HIV REV INTERACTING 
PROTEIN SIGNATURE 


PR00405B 11.83 5.737e- 
20 42-60 PR00405A 
17.71 2.703e-17 23-43 
PR00405C 19.41 6.902e- 
10 63-85 


1116 


BL00355 


HMG14 and HMG17 
proteins . 


BL00355 5.97 2.528e-25 
20-51 


1117 


BL0035S 


HMG14 and HMG17 
proteins . 


BL00355 5.97 2.528e-25 
20-51 


1120 


BL00107 


Protein kinases ATP- 
binding region proteins. 


BL00107B 13.31 4.857e- 
10 290-306 


1123 


PR 00412 


EPOXIDE HYDROLASE 
SIGNATURE 


PR00412F 18.76 9.526e- 
12 301-324 


1125 


PRO 01 86 


HEMERYTHRIN SIGNATURE 


PR001B6A 13.62 2.800e- 
09 87-101 


1129 


BL0017O 


Cyclophilin- type 
pep tidyl -prolyl cis- 
trans isomerase 
signatur. 


BL00170C 18.49 3.077e- 
33 84-129 BL00170B 
20.97 6.838e-25 37-77 
BL00170A 17.08 3.45Se- 
15 10-37 


1131 


BL00636 


Nt-dnaJ domain proteins. 


BL00636A 8.07 5.3 04e- 
15 29-46 BL00636B 
15.11 1.360e-14 59-80 


1132 


BL00678 


Trp-Asp (WD) repeat 
proteins proteins. 


BL00678 9.67 6.211e-09 
29-40 


1133 


BL00678 


Trp-Asp (WD) repeat 
proteins proteins. 


BL00678 9.67 6.211e-09 
29-40 


1136 


BL00990 


Clathrin adaptor 
complexes medium chain 
proteins. 


BL00990C 18.78 4.176e- 
38 235-269 BLO0990A 
21.44 4.316e-36 94-132 
BL00990B 20.15 2.125e- 
27 157-187 BLO0990D 
16.13 5.320e-18 403- 
422 


1137 


PR00314 


CLATHRIN COAT ASSEMBLY 
PROTEIN SIGNATURE 


PR00314B 15.68 B.OOOe- 
34 100-128 PR00314D 
9.66 3.531e-33 233-261 
PR00314C 16.05 8.909e- 
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ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 








32 159-188 PR00314A 
14.53 1.281e-22 13-34 


1139 


BL01115 


GTP- binding nuclear 
protein ran proteins. 


BL01115A 10.22 6.364e~ 
13 13-57 


1141 


BL00107 


Protein kinases ATP- 
binding region proteins. 


BL00107A 18.39 4.00Ge- 
19 451-4B2 BL00107B 
13.31 3.077e-12 519- 
535 


1149 


PR00685 


TRANSCRIPTION INITIATION 
FACTOR I IB SIGNATURE 


PR00685A 13.62 4.676e~ 
09 21-42 


1155 


PD01652 


RECEPTOR CELL NK 
GLYCOPROTEIN IMMUNOGLOB . 


PD01652B 8.50 9.396e- 
10 522-574 PD01652B 
8.50 9.463e-10 740-792 


1157 


PD02894 


HYDROLASE N4- PRECURSOR 
PROTEIN SIGNAL BE. 


PD02894A 21.96 7.B73e- 
28 81-127 PD02894B 
13.93 1.188e-27 178- 
211 


1159 


BLO0623 


GMC oxidoreductases 
proteins . 


BL00623E 15.00 3.531e- 
20 391-414 BL00623C 
10.86 4.240e-20 155- 
176 


1161 


PD01937 


DNA PROTEIN POLYMERASE 
KNDONUCLEASE DNA- . 


PD01937A 6.68 3.475e- 
09 330-341 


1162 


PD01937 


DNA PROTEIN POLYMERASE 
ENDONUCLEASE DNA- . 


PD01937A 6.68 3.475e- 
09 221-232 


1163 


PRO 0624 


HI STONE H5 SIGNATURE 


PR00624D 11.94 7.455e- 
10 214-239 PR00624D 
11.94 1.961e-09 312- 
337 


1167 


BL00226 


Intermediate filaments 
proteins . 


BL00226B 23.86 7.384e- 
09 302-350 


1177 


BL01032 


Protein phosphatase 2C 
proteins . 


BL01032G 8.33 1.422e- 
10 34-48 


1178 


PR00320 


G- PROTEIN BETA WD- 40 
REPEAT SIGNATURE 


PR00320A 16.74 1.794e- 
10 205-220 PR00320C 
13.01 7.840e-10 205- 
220 PR00320B 12.19 . 
B.457e-10 35-50 
PR00320A 16.74 7.146e- 
09 35-50 PR00320B 
12.19 9.100e-09 79-94 


1180 


PR00454 


ETS DOMAIN SIGNATURE 


PR00454D 10.89 4.150e- 
19 765-784 


1181 


BL00291 


Prion protein. 


BL00291A 4.49 8.962e- 
11 152-187 


1184 


BL00720 


Guanine -nu cle ot ide 
dissociation stimulators 
CDC25 family sign. 


BL00720B 16.57 4.103e- 
18 1089-1113 


1185 


BL00215 


Mitochondrial energy 
transfer proteins. 


BL00215A 15.82 4.553e- 
13 204-229 BL00215A 
15.82 1.429e-12 11-36 
BL00215A 15.82 9.809e- 
11 104-129 


1187 


BL00983 


Ly-6 / u-PAR domain 
proteins . 


BL00983C 12.69 2.761e- 
10 77-93 


1188 


BLQG878 


Ora/DAP/Arg 

decarboxylases family 2 
pyridoxal-P attachment 
si. 


BL00878B 10.95 6.000e- 
16 189-204 BL00878C 
17.74 8.435e-15 225- 
245 BL00878F 19.67 
3.625e-13 379-402 
BL00878D 16.56 1.621e- 
09 270-289 


1191 


PD02939 


PROTEIN GLUTATHIONE j 
SYNTHETASE SY. 


PD02939B 10.10 2.723e- 
12 203-220 PD02939C 
20.01 l.OOOe-ll 224- 
252 


1193 


PR00345 


STATHMIN FAMILY 
SIGNATURE 


PR00345B 7.12 2.800e- 
28 72-101 PR00345B 
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NO. 


DESCRIPTION 


RESULTS* 








8.54 7.652e-28 149-174 
PR00345C 4.54 9.100e- 
28 101-125 PR0034SD 
10.97 1.964e-24 125- 
149 PR00345A 13.46 
5.645e-16 43-62 


1194 


PRO 0345 


STATHMIN FAMILY 
SIGNATURE 


PR0034SB 7.12 2.800e- 
28 108-137 PR00345E 
8.54 7.652e-28 185-210 
PR00345C 4.54 9.100e- 
28 137-161 PR00345D 
10.97 1.964e-24 161- 
185 PR00345A 13.46 
5.645e-16 79-98 


1195 


PP00995 


Seel family. 


PF00995B 17.37 1.120e- 
13 224-264 


1196 


BL00932 


Bacterial -type phytoene 
dehydrogenase proteins. 


BL00982A 18.41 6.73Be- 
11 15-47 


1197 


BL01298 


Di hydrodipi col ina te 
reductase proteins. 


BL01298A 13.90 5.959c- 
09 51-73 


1203 


BL00061 


Short -chain 

dehydrogenases /reductase 
s family proteins . 


BL00061B 25.79 l.OOOe- 
14 152-190 


1204 


PR00118 


BETA- LACTAMASE CLASS A 
SIGNATURE 


PR00118F 16.42 9.386e- 
09 213-229 


1206 


BL01183 


ubiE/COQ5 

methyl transferase family 
proteins . 


BL01183B 21.31 1.429e- 
37 184-229 BL01183D 
27.71 8.535e-27 264- 
307 BL01183A 13.25 
3.250e-23 51-73 
BL01183C 10.77 5.295e- 
09 246-258 


1208 


BL00979 


G -protein coupled 
receptors family 3 
proteins . 


BL00979L 20.63 2.485e- 
09 105-146 


1209 


PFC0023 


Ank repeat proteins. 


PF00023A 16.03 4.857e--~ 
11 49-65 P?00023B 
14.20 1.818e-09 45-55 


1212 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 7.750s- 
14 227-241 PR00048A 
10.52 4.316e-ll 199- 
213 


1213 


PRO 04 50 


RECOVER IN FAMILY 
SIGNATURE 


PR00450C 12.22 1.720e- 
10 20-42 PR00450C 
12.22 3.506e-09 56-78 
PR00450D.16.58 6.769e- 
09 44-64 


1216 


BL00412 


Neuromodulin (GAP- 43) 
proteins . 


BL00412D 16.54 5.598e- 
10 179-230 


1219 


PRO04S6 


RIBOSOMAL PROTEIN P2 
SIGNATURE 


PR00456E 3.06 5.348e- 
11 249-264 


1222 


PD00066 


PROTEIN ZINC- FINGER 
METAL BIND I . 


PD00O66 13.92 7.231e- 
15 295-308 PD00066 
13.92 7.231e-15 406- 
419 PD00066 13.92 
2.286e-12 378-391 
PD00066 13.92 7.857e- 
12 434-447 PD00066 
13.92 3.348e-ll 350- 
363 


1223 


BL500S8 


G-protein gamma subunit 
profile . 


BL50058 27.23 l.OOOe- 
40 13-61 


1226" 


BL00412 


Neuromodulin (GAP-43) 
proteins . 


BL00412D 16.54 8.439e- " 
09 279-330 


1227 


BL00437 


Catalase proximal heme- 
ligand proteins. 


BL00437A 18.82 1.00 - 0e :: 
40 49-101 BL00437B 
16.28 1.000e-40 114- 
168 BL00437C 21.86 
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NO. 
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1.000e-40 190-239 
BL00437D 25.72 l.OOOe- 
40 248-301 BL00437E 
23.95 1.000e-40 327- 
379 


1230 


BL01160 


Kinesin light chain 
repeat proteins. 


BL01160B 19.54 8.297e"^~~ 
10 5-60 


1231 


PR00735 


GLYCOSYL HYDROLASE 
FAMILY 8 SIGNATURE 


PR00735A 11.19 6.857e- 
09 391-405 


1232 


PRO 04 97 


NEUTROPHIL CYTOSOL 
FACTOR P40 SIGNATURE 


PR00497A 6.92 5.553e- 
10 158-176 


1233 


PRO 04 97 


NEUTROPHIL CYTOSOL 
FACTOR P40 SIGNATURE 


PR00497A 6.92 5.553e- 
10 158-176 


1235 


BL00866 


Carbamoyl -phosphate 
synthase subdomain 
proteins . 


BL00866B 36.29 2.776e- 
09 75-121 


1237 


BL00027 


'Homeobox 1 domain 
proteins . 


BL00027 26.43 1.8l8e~ \ 
21 36-79 


1243 


PRO 04 03 


WW DOMAIN SIGNATURE 


PR00403B 12.19 1.184e- 
11 10-25 


1246 


PD01168 


SYNTHETASE LIGASE 
PROTEIN ALANYL. 


PD01168L 9.47 2.837e- 
10 31-46 PD0116BL 
9.47 4.490e-10 174-189 
PD0116BL 9.47 7.612e- 
10 183-198 


1249 


BLQ001B 


EF-hand calcium-binding 
domain proteins. 


BL00018 7.41 2.800e-10 
183-196 


1254 


BLD01B3 


Ubiquit in- conjugating 
enzymes proteins. 


BL00183 28.97 2.440e- 
36 96-144 


1255 


BL01115 


GTP-binding nuclear 
protein ran proteins. 


BL01115A 10.22 5.670e- 
11 8-52 


1256 


BL00373 


Phosphor ibosylg ly c i namid 
e formyl transferase 
proteins. 


BL00373C 10.35 3.348e- 
12 143-156 


1258 


PRO 0011 


TYPE III EGF-LIKE 
SIGNATURE 


PR00011B 13.08 3.217e- 
10 174-193 


1259 


BL00518 


Zinc finger, C3HC4 type 
CRING finger) , proteins . 


BL00518 12.23 8.286e- 
10 31-40 


1261 


PRO 00 70 


DIHYDROFOLATE REDUCTASE 
SIGNATURE 

\ 


PR00070D 11.63 l.OOOe- 
15 112-127 PR00070C 
13.09 9.500e-15 51-63 
PR00070A 12.92 S.SOOe- 
12 16-27 


1262 


BL00462 


Gamma - 

glutamyltranspeptidase 
proteins. 


BL00462A 20.89 6.438e- 
24 140-183 BL00462B 
17.88 5.500C-20 230- 
267 BL00462C 27.41 
2.023e-ll 292-347 


1263 


BL00038 


Myc-typc, 'helix- loop- 
helix' dimerization 
domain proteins. 


BL00038B 16.97 9.455e- 
11 62-83 


1264 


BL01115 


GTP -binding nuclear 
protein ran proteins. 


BL01115A 10.22 5.670e- 
11 17-61 


1266 


PR00837 


ALLERGEN V5/TPX-1 FAMILY 
SIGNATURE 


PR00837C 17.21 2.714e- 
18 165-182 PR00837A 
14.77 4.512e-12 86-105 
PR00837D 11.12 7.577e- 
12 201-215 


1269 


PRO 0449 


TRANSFORMING PROTEIN P21 
RAS SIGNATURE 


PR00449C 17.27 9.308e- 
22 40-63 PR00449E 
13. SO 1.000e-16 137- 
160 PR00449D 10.79 
3.520e-ll 102-116 


1270 


BL00276 


Channel forming colic ins 
proteins. 


BL00276A 8.87 l.SOOe- 
09 17-29 


"1275 


PD02327 


GLYCOPROTEIN ANTIGEN 
PRECURSOR IMMUNOGLO. 


PD02327C 15.47 9.769e- 
09 228-243 


1276 


PRO 04 12 


EPOXIDE HYDROLASE 


PR00412B 12.59 7-894e- 
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NO. 
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SIGNATURE 


12 119-135 PR00412C 
11.30 1.857e-ll 165- 
179 PR00412A 13.23 
3.400e-ll 100-119 


1277 


PF00756 


Putative esterase. 


PF00756C 14-12 9.53Be- 
10 127-157 


1279 


BL00134 


Serine proteases, 
trypsin family, 
hi3tidine proteins. 


BL00134A 11.96 9.325e- 
13 128-145 


1280 


BL01220 


Phosphat idyl ethanol amine 
-binding protein family 
proteins . 


BL01220C 14.75 9.348e- 
15 248-276 


1285 


BL00518 


Zinc finger, C3HC4 type 
(RING finger) , proteins . 


BL00518 12.23 2.286e- 
10 33-42 


1287 


PP00791 


Domain present in ZO-1 
and UncS-like netrin 
receptors . 


PF00791B 28.49 7.1B2e- 
11 288-343 


1292 


PROOB02 


SERUM ALBUMIN FAMILY 
SIGNATURE 


PR00BO2B 16.51 1.610e- 
10 81-105 


1297 


PR00716 


M- PHASE INDUCER 
PHOSPHATASE SIGNATURE 


PR00716C 17.65 5.696e- 
09 23-44 


1298 


BL00478 


LIM. domain proteins. 


BL00478B 14.79 6.47Be- 
14 268-283 


1301 


BL00127 


Pancreatic ribonuclease 
family proteins. 


BL00127C 31.49 3.571e- 
28 82-126 BL00127B 
26.57 8.BO0e-28 23-68 


1302 


PR00637 


TYPE 3 BOMBESIN RECEPTOR 
SIGNATURE 


PR00637B 11_27 4.250e- 
09 290-306 


1307 


BL00215 


Mitochondrial energy 
transfer proteins. 


BL00215A 15.82 5.500e- 
17 13-38 BL00215A 
15.82 1.000e-16 226- 
251 BL00215A 15.82 
2.65Be-13 107-132 


1308 


PR00898 


VASOPRESSIN V2 RECEPTOR 
SIGNATURE 


PR00898H 11.34 4.682e~ 
09 552-572 


1309 


PD00301 


PROTEIN REPEAT MUSCLE 
CALCIUM-BI . 


PD00301B 5.49 2.731e- 
09 390-401 


1310 


BLGQ983 


Ly-6 / u-PAR domain 
proteins . 


BL00983C 12.69 9.654e- 
13 73-89 BL00983B 
8.19 3.132e-09 12-22 


1313 


BL00194 


Thioredoxin family 
proteins . 


BL00194 12.16 1.900e- 
11 15-28 


"1314 


BL00594 


Aromatic amino acids 
permeases proteins. 


BL00594A 16.75 8.969e- 
10 53-97 


1316 


BL00134 


Serine proteases, 
trypsin family, 
hiBtidine proteins. 


BL00134A 11.96 9.325e- 
13 128-145 


1320 


BL00783 


Ribosomal protein LI 3 
proteins. 


BL00783C 22.43 6.559e- 
24 87-117 BL00783A 
14.55 l-600e-19 8-33 
BL007B3B 12.76 3.500e- 
12 74-86 


1327 


PFO0514 


Armadillo/beta- catenin- 
like repeat proteins. 


PF00514A 31.30 7.268e- 
11 82-120 


1329 


BL00030 


Eukaryotic RNA-binding 
region RNP-1 proteins. 


BL00030A 14.39 6.294e- 
11 129-148 BL0D030B 
7.03 4.789e-09 168-178 


1331 


PR00497 


NEUTROPHIL CYTOSOL 
FACTOR P40 SIGNATORH 


PR00497A 6.92 7..239e- 
09 25-43 


1332 


PR001G1 


NICKEL- DEPENDENT 
HYDROGENASE/B - type 
CYTOCHROME SIGNATURE 


PR00161C 9.51 4.930e- 
09 317-337 


1333 


PD01066 


PROTEIN ZINC FINGER 
ZINC-FINGER METAL- 
BINDING NU. 


PD01066 19.43 6.769e- " 
33 10-49 


133* 


PR00700 


PROTEIN TYROSINE 
PHOSPHATASE SIGNATURE 


PR00700D 12.47 2.200e- 
09 262-281 


1337 


PK00V00 | PROTEIN TYROSINE 


FR00700D 12.47 2.200e- 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 






PHOS PHATASB S IGNATURK 


09 211-230 


1340 


PR00860 


VBRTEBRATB 

METALLOTHI0NEIN 

SIGNATURE 


PR00860A 5.46 5.034e- 
13 5-18 


1341 


BL00893 


mutT domain proteins. 


BLO0893 18.99 6-750e- 
16 46-71 


1343 


BL01282 


BIR repeat proteins. 


BL012B2B 30.49 5.974e- 
21 383-422 


1344 


DM00099 


4 Jew A55R REDUCTASE 
TERMINAL 

DIHYDROPTERIDINB . 


DMO0O99B 14.73 8.313e- 
09 417-427 


1345 


BL00923 


Aspartate and glutamate 
racemasea proteins . 


BL00923B 11.41 5.935e- 
in lie 1 A c 


1348 


PF00651 


BTB (also knovm as BR- 
C/Ttk) domain proteins. 


PP00651 15.00 7.231e- 
13 44-57 


1350 


PR00193 


MYOSIN HEAVY CHAIN 
SIGNATURE 


PR00193D 14.36 3.571e- 
32 416-445 PR00193C 
12.60 6.318e-3l 179- 
207 PR00193B 11.69 
3,571e-24 133-159 
PR00193B 19.47 9.069e- 
22 470-499 PR00193A 
15.41 1.783e-20 77-97 


1352 


PR00447 


NATURAL RESISTANCE- 
ASSOCIATED MACROPHAGE 
PROTEIN SIGNATURE 


PR00447E 9.73 1.554e- 
15 299-319 PR00447D 
13.54 3.408e-15 200- 
224 PR00447A 12.73 
6.357e-ll 97-124 
PRD0447G 6.69 9.877e- 
10 353-373 


1353 


BLU0303 


S-100/ICaBP type calcium 
binding protein. 


BI.00303A 21.77 6.667e- 
26 45-82 BL00303B 
26.15 1.000e-24 93-130 


1355 


BL00039 


DEAD -box subfamily ATP- 
dependent helicases 
proteins. 


BL00039D 21.67 5.950e- 
29 375-421 BL00039A 
18.44 7.136e-29 99-138 
BL00039C 15.63 4.000e- 
18 225-249 BL00039B 
19.19 3.182e-14 141- 
167 


1357 


PF00615 


Regulator o£ G protein 
signalling domain 
proteins - 


PF0D615B 16.25 2-216e- 
12 84-101 PF00615C 

Iv.UO 0.41Ze**12 162- 

176 


1360 


PDO1066 


PROTEIN ZINC FINGER 
ZINC- FINGER METAL- 
BINDING NU. 


PD01066 19.43 9.234e- 
29 10-49 


1361 




MUJNHXoXUrtJS UlKUnUoUfiAb 

PROTEIN HMG17 FAMILY 
SIGNATURE 


18 14-29 PR00925B 
3.73 6.143e-14 29-42 
PRQ0925C 5.57 4.789a- 
12 53-64 PR00925D 
6.56 1.857e-10 76-87 


1362 


BL01272 


Glucokinase regulatory 

nimt't* f t* faro i 1 \r r*>T*nt*A > n c 

M^wLulll A* QUI J. jf ^JLUUClllO • 


BL01272B 19.61 6.870e- 
10 116-ni nuninyc 
11.68 3.314e-25 249- 
274 BL01272A 6.49 
1.23le-18 99-117 


1363 


BL01272 


Glucokinase regulatory 
protein family proteins. 


BL01272B 19.61 6.870e- 1 
30 113-148 BL01272C 
11.68 3.314e-25 226- 
251 BL01272A 6.49 
1.231B-18 76-94 


1364 


DM00179 


W KINASE ALPHA ADHESION 
T-CELL. 


DM00179 13.97 5.304e- 
09 167-177 


1368 


PR00169 


POTASSIUM CHANNEL 
SIGNATURE 


PR00169A 16.77 1.592e- 
09 76-96 


1370 


PR00988 


URIDINE KINASE SIGNATURE 


PR00988A 6.39 1.794e- 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 








10 1-19 


1371 


BL00242 


Integrins alpha chain 
proteins . 


BL00242B 8.13 B.615e- 
09 469-479 


1372 


PR0062S 


DNAJ PROTEIN FAMILY 
SIGNATURE 


PR00625B 13.48 7.353e- 
19 46-67 PR00625A 
12.84 I.39le-16 14-34 


1373 


BL0Q434 


HSF-type DNA-binding 
domain proteins." 


BL00434C 23.85 3.778e- 
09 90-130 


1374 


PRO0962 


LETHAL (2) GIANT LARVAE 
PROTEIN SIGNATURE 


PR00952C 8.00 6.337e- 
09 505-526 


1375 


PD02475 


MUCIN EPITHELIAL TUMOR- 
ASSOCIATE. 


PD02475A 23.18 8.552e- 
10 1111-1150 


1376 


PD01066 


PROTEIN ZINC FINGER 
ZINC- FINGER METAL - 
BINDING NO. 


PD01066 19.43 9.571e- 
32 24-63 


13 80 


BL00194 


Thioredoxin family 
proteins . 


BLC0194 12.16 B.333e- 
12 48-61 


1381 


DM01970 


0 kw ZK632.12 YDR313C 
ENDOSOMAL III. 


DM01970B 8.60 1.458e- 
1S 1123-1136 


1383 


BL00678 


Trp-Asp (WD) repeat 
proteins proteins. 


BL00678 9.67 7.600e-10 
243-254 


1384 


BL00676 


Trp-Asp (WD) repeat 
proteins proteins. 


BL00678 9.67 7.600e-10 
271-282 


1385 


BL00303 


S-100/lCaBP type calcium 
binding protein. 


BLO0303B 26.15 6.2Q3e- 
10 95-132 


1386 


BL01160 1 


Kinesin light chain 
repeat proteins. 


BL01160B 19.54 S.042e- 
09 1574-1628 


1387 


BL00518 


Zinc finger, C3HC4 type 
(RING finger) , proteins . 


BL00518 12.23 l.OOOe- 
11 52-61 


1389 


PDQ1066 


PROTEIN ZINC FINGER 
ZINC- FINGER METAL - 
BINDING NU. 


PD01066 19.43 3.600e- 
30 10-49 


1390 


PDO1066 


PROTEIN ZINC FINGER 
ZINC-FINGER METAL- 
BINDING NU. 


PD01066 19.43 3.512e- 
31 32-71 


1392 


PROO308 


TYPE I ANTIFREEZE 
PROTEIN SIGNATURE 


PR0030BC 3.83 9.723e- 
10 127-137 


1393 


PR00380 


KINESIN HEAVY CHAIN 
SIGNATURE 


PR00380A 14. IB 9.625e- 
25 88-110 PR00380D 
9.93 2.406e-20 304-326 
PR00380B 12.64 4.414e- 
16 208-226 PR00380C 
13:10 6.538e-16 243- 
262 


1394 


PD00066 


PROTEIN ZINC- FINGER 
METAL -BINDI. 


PD00066 13.92 3.400e- 
14 462-475 PD00066 
13.92 8.800e-14 348- 
361 PD00066 13.92 
9.571e-12 405-418 - 
PD00066 13.92 6.087e- 
11 490-503 PD00066 
13.92 B.043e-ll 320- 
333 


139B 


PD01066 


PROTEIN ZINC FINGER 
ZINC-FINGER METAL- 
BINDING NU. 


PD01066 19.43 6.786c- 
32 10-49 


1400 


DM01206 


CORONAVIRUS NUCLEOCAPSID 
PROTEIN. 


DM01206B 10.69 7. 038e- 
09 270-290 


1406 


PD0093O 


PROTEIN GTPASK DOMAIN 
ACTIVATION. 


PD00930A 25.62 7.324e- 
15 363-389 


1407 


BL00030 


Eukaryotic RNA binding 
region RNp-i proteins. 


BL00030A 14.39 7.500e- 
10 457-476 


1408 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 9.550e- 
11 179-193 PR00019A 
11.19 8.826e-l0 228- 
242 PRO0O19B 11.36 
1.360e-09 199-213 
PR00019B 11.36 4-960e- 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 








09 176-190 


1409 


PR00510 


NBBULIN signature 


PR00510A 9.09 4.150e- 
12 182-202 PR00510B 
12.96 8.767e-12 210- 
230 PR00510F 9.88 
8.172e-10 58-75 
PROOblOD 9.21 2.3B7e- 
09 251-267 


1410 


PD00078 


RBPEAT PROTEIN ANK 
NUCLEAR ANKYR. 


PD00078B 13.14 5.696e- 
09 31-44 


1412 


BL00358 


Ribosoraal protein L5 
proteins. 

\ 


BLD0358B 22.76 l.OOCe- 
40 57-103 BL00358C 
13.75 €.087e-14 122- 
136 BL00358D 14.26 
5.500e~13 143-158 
BL00358A 13.06 1.931e- 
11 33-44 


1414 


BL00282 


Kazal serine protease 
inhibitors family 
proteins . 


BL00282 16.88 7.338e- 
10 511-534 


1415 


BL00023 


Type II fibronectin 
collagen-binding domain 
proteins . 


BL00023 24.31 4.300e- 
29 40-77 


1417 


PR00681 


RIBOSOMAL PROTEIN SI 
SIGNATURE 


PR00681G 12.54 2.149e- 
09 38-60 


1418 


DM00973 


3 fcw RESISTANCE BENOMYL 
YLL028W CYCLOHEXIMIDB. 


DM00973A 21.17 1.462e- 
09 171-208 


1419 


PR00319 


BETA G- PROTEIN 
(TRANSDUCIN) SIGNATDRB 


PR00319B 11.47 1 . 571e- 
09 428-443 


1420 


PD01941 


TRANSMEMBRANE 
COTRANS PORTER SYMP. 


PD01941A 14.81 l.OOOe- 
40 142-196 PD01941B 
15.02 7.049e-30 400- 
447 PD01941E 15.92 
2.475e-20 817-864 
PD01941C 19.96 3.118e- 
19 488-543 PD01941D 
27.18 9.614e-18 641- 
690 PD01941F 28.52 
5.382e-15 1038-1093 


1422 


PRO 02 05 


CADHBRIN SIGNATURE 


PR00205B 11.39 8.043e- 
12 199-217 


1423 


PRO 02 09 


ALPHA/BETA GLIADIN 
FAMILY SIGNATURE 


PR00209B 4.88 6.318e- 
11 1009-1028 


1424 


BL50002 


Src homology 3 (SH3) 
domain proteins profile. 


BL50002A 14.19 8.200e~ 
14 367-386 BLS0002A 
14.19 9.250e-12 298- 
317 BL50002A 14.19 
4.462e-H £.Ua-£.z.f 
BL50002B 15.18 l.OOOe- 
09 244-258 


1425 


krifuObZB 


PHD- finger . 


T~>T?/lfiOn 1 CZ RA 1 (\A f^ck — 
L*JrUUb/iD X.X> . Wit 

12 330-345 


1426 


tj urA n c o o 


PHD- finger . 


yrUUo/O ID • OH. 3 -VHtOG 

12 377-392 


1427 


PR0040S 


HIV REV INTERACTING 
PROTEIN SIGNATURE 


PR00405B 11.83 5.114e- 
16 281-299 PR00405A 

A. f - / X 1 , JUOCJ.* .ZD-* 

282 


1428 


BL00039 


DEAD-box subfamily ATP- 
dependent hel leases 
proteins . 


BL00039D 21.67 5.219e- 
34 147-193 


1429 


PR00320 


G~ PROTEIN BETA WD-40 
REPEAT SIGNATURE 


PR00320C 13.01 8.920e- 
10 577-592 


1430 


PR00378 


INOSITOL PHOSPHATASE 
SIGNATURE 


PR00378D 16.86 7.563e- 
12 295-314 PR00378B 
13.80 8.650e-10 166- 
186 


1431 


PRO 0928 


GRAVES DISEASE CARRIER 


PR00928B 13.53 3.769e- 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 






PROTEIN SIGNATURE 


10 103-124 


1433 


BL01113 


Clq domain proteins. 


BL01113B 18.26 7.049e- 
15 14-50 BL01113C 
13.18 7.000e-12 82-102 


1434 


PR00319 


BETA G- PROTEIN 
(TRANSDUCIN) SIGNATURE 


PR00319B 11.47 7.983e- 
10 135-150 


1436 


BL00030 


Eukaryotic RNA-binding 
region RNP-1 proteins. 


BL00030A 14.39 1.000c- 
12 84-103 


1438 


BL00290 


Immunoglobulins and 
major histocompatibility 
complex proteins. 


BL00290B 13.17 2.500e- 
09 250-268 BL00290A 
20.89 4.000e-09 1B8- 
211 


1440 


PR00806 


VINCULIN SIGNATURE 


PR00806B 4.28 4.960e- 
09 38-52 


1441 


PR008D6 


VINCULIN SIGNATURE 


PR00806B 4.28 4.960e- 
09 88-102 


1444 


BL00422 


Granins proteins. 


BL00422D 19.48 l.OOOe- 
08 114-138 


1445 


PD01B41 


PHOSPHORYLASE KINASE 
ALPHA MUSCL. 


PD01B41A 21.71 l.OOOe- 
40 73-123 PD01841B 
14.35 l.OOOe-40 144- 
185 PD01841D 17.87 
l.OOOe-40 206-258 
PD01841F 13.36 l.OOOe- 
40 296-345 PD01841G 
24.26 l.OOOe-40 349- 
403 PD01841I 23.00 
l.OOOe-40 494-536 
PD01841J 14.94 l.OOOe- 
40 895-932 PD01841L 
18.42 1.000e-40 1083- 
1125 PD01B41B 18.60 
9.719e-38 258-2S6 
PD01841K 14.81 l.OOOe- 
35 1041-1071 PD01841H 
21.30 3.189S-31 435- 
472 PD01841C 13.78 
1.000e-25 185-206 
PD01841M 10.82 1.250c- 
20 1175-1194 


1446 


PF00816 


H-NS his tone -family. 


PF00816B 13.84 8.875e- 
09 190-220 


1447 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 2.080e- 
09 402-416 


1448 


DM00315 


072 RIBONUCLEASE 
INHIBITOR. 


DM00315D 18.40 7.393e- 
09 23-67 


1451 


BLO0O3O 


Eukaryotic RNA-binding 
region RNP-1 proteins. 


BL00030B 7.03 2.800e- 
10 94-104 


1454 


DM01688 


2 POLY-IG RECEPTOR. 


DM01688D 13.44 7.l46e- 
09 3B2-405 


1455 


PF00777 


Sialyl transferase 
family. 


PF00777C 18.60 2.929e- 
22 4-59 


1457 


BL00927 


Trehalase proteins. 


BL00927C 10.83 8.085e- 
09 42-53 


1460 


BLO0545 


Aldose 1-epimeraee 
proteins. 


BL00545C 11. 2B 7.353e- 
17 169-182 BLO0545A 
10.20 2.071e-15 73-89 
BL00545B 13.10 3.942e- 
09 140-153 


1466 


PR00097 


ANTHRANILATB SYNTHASE 
COMPONENT II SIGNATURE 


PR00097C 9.42 9.069e- 
09 233-245 


1472 


BL01129 


Hypothetical 
yabO/yceC/sfhB family 
proteins . 


BL01129E 13.25 5.250c- 
22 170-195 BL01129C 
25.56 9.526e-18 63-106 


1473 


BL00790 


Receptor tyrosine kinase 
class V proteins. 


BL0079OI 20.01 2.821e- 
09 2114-2145 


1475 


PF00686 


Starch binding domain 
proteins. 


PP006B6A 13.45 9.100e- 
09 267-277 
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ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 


1477 


PF00566 


Probable rabGAP domain 
pro be ins . 


PF00S66A 12.64 7.333e- 
10 466-476 


147B 


BL00030 


Eukaryotic RNA-biading 
region RNP-1 proteins. 


BL00030B 7.03 9.400e- 
10 43-53 


1479 


DM00406 


GLIADIN . 


DM00406 7.73 8-541e-10 
292-305 


1480 


BL0O290 


Immunoglobulins and 
major histocompatibility 
complex proteins. 


BL00290B 13.17 2.385e- 
15 69-87 BL00290A 
20.89 5.091e-ll 12-35 


1481 


PRO 0150 


PHOS PHOENOliP YRUVATE 
CARBOXYLASE SIGNATURE 


PR00150F 10.45 9.039e- 
09 21-51 


1482 


PF007B0 


Domain found in NI Ki- 
ll ike kinases, mouse 
citron and yeast ROM. 


PF00780I 14.69 4.825e- 
09 107-137 


1483 


BL01160 


Kinesin light chain 
repeat proteins. 


BL01160B 19.54 1.153e- 
09 108-162 


1485 


PD01066 


PROTEIN ZINC FINGER 
ZINC- FINGER METAL- 
BINDING NU. 


PD01066 19.43 5.909e- 
25 17-56 


1486 


BL00107 


Protein kinases ATP- 
binding region proteins. 


BL00107B 13.31 1.529e- 
09 34-50 


1488 


BL00039 


DEAD- box subfamily ATP- 
dependent helicases 
proteins . 


BL00039D 21.67 9.586e- 
10 116-162 


1490 


BL00166 


Enoyl-CoA 

hydratase/isomerase 
proteins . 


BL00166D 22.87 2.607e- 
24 190-226 BL00166C 
18.93 5.500e-14 140- 
167 BL00166B 16.92 
9.357e-ll 93-115 


1491 


BL00452 


Guanylate cyclases 
proteins . 


BL00452D 28.59 3.700e- 
31 63-106 BL00452E 
11.92 3.045e-13 115- 
131 


1492 


PR00019 


LEUCINE- RICH REPEAT 
SIGNATURE 


PR00019A 11.19 3.667C- 
09 532-546 


1497 


BL00107 


Protein kinases ATP- 
binding region proteins. 


BL00107B 13.31 l.OOOe- 
11 384-400 BL00107A 
18.39 5.345e-ll 322- 
353 


1500 


PF00876 


Ogre family. 


PF00876E 7.99 1.947e- 
10 107-117 


1502 


BL0O027 


'Homeobcoc' domain 
proteins . 


BL00027 26.43 4.789e- 
24 112-155 


1503 


BL00027 


1 Homeobcoc ' domain 
proteins . 


BL00027 26.43 4.789e- 
24 112-155 


1505 


BL01177 


Anaphylatoxin domain 
proteins . 


BL01177B 20.64 S.BOOe-, 
24 448-475 BL0I177C 
17.39 5.333e-19 402- 
421 BL01177B 13.61 
7.840e-16 155-171 
BL01177D 17.50 1.900e- 
15 427-445 


1506 


BL00972 


Ubiquitin carboxyl- 
terminal hydrolases 
family 2 proteins. 


BLO0972D 22.55 5.5006- 
14 311-336 BL00972A 
11.93 7.429e-14 48-66 
BL00972E 20.72 B.759e- 
10 341-363 


1512 


BL00523 


Sulfatases proteins. 


BL00523B 19.27 4.536e- 
22 76-106 BL00523D 
9.89 1.563e-ll 40-52 
BL00523F 10.85 4.162e- 
09 159-170 BL00523G 
9.46 5.333e-09 256-266 


1516 


BL00914 


Syntaxin / epimorphin 
family proteins. 


BL00914 24.91 7.045e- 
14 168-218 


1518 


BL00600 


Aminotransferases class- 
Ill pyridoxal -phosphate 
attachment si. 


BL00600A 17.98 6.143e- 
19 98-122 BL00C00E 
16.43 1.77le-17 302- 
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ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 








331 BL006C0G 12.43 
9»62Se-17 377-396 
BL00600B 19.60 5.091e- 
15 160-186 BL00600C 
16.18 6.04Ce-l2 190- 
206 BL006COF 8.77 \ 
1.000e-ll 343-356 
BLO0600D 8.71 l.OOOe- 
10 281-295 


1523 


PD00930 


PROTEIN GTPASB DOMAIN 
ACTIVATION. 


PDO0930B 33.72 9.600e- 
18 41-82 


1528 


PR00320 


G- PROTEIN BETA WD -40 
REPEAT SIGNATURE 


PRO032OB 12.19 4.774e- 
11 192-207 PR00320B 
12.19 8.839e-ll 272- 
287 PR00320B 12.19 
9.743e-10 106-121 
PRO0320A 16.74 1.878e- 
09 192-207 PR00320A 
16.74 2.317e-09 106- 
121 PR0D320A 16.74 
8.683e-09 272-287 
PR00320C 13.01 8.800e- 
09 106-121 


153 8 


DM01970 


0 kw ZK632.12 YDR313C 
END0SOMAL III. 


DM01970B 0,60 4.508e- 
15 171-1B4 


1539 


PF00781 


Diacylglycerol kinase 
catalytic domain 
proteins (presumed) . 


PF00781D 11.11 7.593e- 
10 103-127 


1540 


PR00965 


OCULAR ALBINISM TYPE 1 
PROTEIN SIGNATURE 


PR00965H 10.73 1.231e- 
29 312-334 PR00965E 
12.93 5.846e-29 172- 
195 PR00965F 5.98 
1.123e-28 209-231 
PR00965C 15.04 l.OOOe- 
27 131-151 PR00965D 
5.84 1.000e-27 150-170 
PR00965G B.52 2.440e- 
27 258-279 PR00965B 
4.80 8.650e-26 88-109 
PR00965A 12.52 l.OOOe- 
25 35-55 PR00965I 
3.91 6.442e-25 385-406 


1541 


BL01013 


Oxys terol - binding 
protein family proteins. 


BL01013D 26.81 9.719e- 
17 163-207 


1543 


PD02699 


PROTEIN DNA- BINDING 
BINDING DNA. 


PD02699C 24.84 l.OOOe- 
40 599-646 PD02699A 
8.91 2.286e-34 219-248 
PD02699B 18.28 6.l43e- 
21 485-509 


1544 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.D0 7.857e- 
10 182-197 PR00049D 
0.00 7.102e-09 67-82 


1547 


BLQ0951 


BR lumen protein 
retaining receptor 
proteins. 


BL00951C 19-35 l.OOOe- 
40 93-142 BLO0951D 
13.94 B.714e-40 142- 
177 BL00951A 15.10 
1.000e-38 2-38 . 
BL00951B 14.23 6.2$0e- 
33 38-69 


1548 


BL00536 


Ubiqui tin-activating 
enzyme proteins. 


BL03536F 13.65 8.920e- 
30 279-318 BLO0S36D 
22.91 5.737e-24 21-65 
BL00536E 16.94 4.696e- 
18 24B-279 


1549 


PR00139 


AS PARAG INASE / GLUT AM INASK 
FAMILY SIGNATURE 


PR00139C 11.72 9.679e- ( 
09 550-569 


1553 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 5.119e- 
09 58-73 
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NO. 
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RESULTS* 


1556 


BL00061 


Short -chain 

dehydrogenaeea/reduc tase 
s family proteins. 


BL00061B 25.79 6.276e- 
13 67-105 


1557 


BL01228 


Hypothetical cof family- 
proteins . 


BL01228D 17.44 8.105e- 
12 107-132 ! 


1558 


BL01228 


Hypothetical cof family 
proteins . 


BL01228D 17.44 8.10Se- 
12 107-132 


15S9 


BL01228 


Hypothetical cof family 
proteins . 


BL01228D 17.44 8.105e- 
12 107-132 


1562 


BL00522 


DNA polymerase family X 
proteins . 


BL00522C 11.90 6.600e~ 
18 412-436 BL00522B 
27.30 1.738e-16 364- 
410 BL00522A 25.52 
6.000e-l6 279-326 
BL00522E 19.63 6.l23e- 
14 502-532 BL00522F 
14.90 2.3B5e-13 551- 
575 


1563 


PF006S1 


BTB (also known as BR- 
C/Ttk) domain proteins. 


PP00651 15.00 1.947e- 
11 46-59 


1564 


BL00299 


Ubiquitin domain 
proteins . 


BL00299 28.84 2.823e- 
10 324-376 


1566 


BL01013 


Oxysterol -binding 
protein family proteins. 


BL01013D 26.81 8.594e- 
17 184-228 BL01013C 
9.97 4.906e-12 14-24 


1567 


BL00678 


Trp-Asp (WD) repeat 
proteins proteins. 


BL00678 9.67 3.400e-10 
378-389 BL00678 9.67 
5.800e-l0 418-429 
BL00678 9.67 8.800e-10 
295-306 


1570 


BL00479 


Phorbol esters / 
diacylglycerol binding 
domain proteins. 


BL00479B 12.57 5.235e- 
17 297-313 BL00479A 
19.86 6.625e-15 271- 
294 BL00479A 19.86 
2.667e-14 147-170 
BL00479B 12.57 6.294e- 
12 173-1B9 


1576 


PROOFS 


OXYTOCIN RECEPTOR 
SIGNATURE 

• 


PR00665G 12.36 4.673e- 
24 364-364 PR00665D 
9.93 1.200e-22 138-155 
PR00665F 11.73 4.000e- 
22 337-354 PR00665C ' 
5.89 1.000e-20 65-80 
PR00665B 5.29 4.337e- 
19 24-39 PR00665E 
5.60 2.929e-15 246-260 
PR00665A S.99 5.622e- 
15 11-25 


1577 


DM00099 


4 kw A55R REDUCTASE 
TERMINAL 

DIHYDROPTERIDINE . 


DM00099B 14.73 9.308e- 
10 127-137 


1579 


BL00524 


Somatomedin B domain 
proteins . 


BL00524A 9.65 6.776e- 
14 52-73 


1580 


PD02894 


HYDROLASE N4- PRECURSOR 
PROTEIN SIGNAL BE. 


PD02894B 13.93 6.959e- 
16 182-215 PD02894A 
21.96 2.125e-10 57-103 


1581 


BL00411 


Kinesin motor domain 
proteins . 


BL00411C 15.04 S.292e- 
12 32-54 BL00411H 
15.66 4.44le-ll 245- 
276 


1582 


PR00604 


CLASS IA AND IB 
CYTOCHROME C SIGNATURE 


PR00604A 11.13 2.440e- 
09 79-87 


1584 


PP00651 


BTB (also known as BR- 
C/Ttk) domain proteins. 


PF00651 15.00 1.000c- 
10 225-238 


1585 


DM01551 


kw OSTEOINDUCTIVE YOPM 
MEMBRANE OUTER. 


DM01551C 14.62 9.455e- 
11 125-145 


1586 


DM01354 


kw TRANSCRIPTASE REVERSE I DM01354S 11.61 7.750e- 
II 0RF2- 1 09 474-495 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 


1597 


PRO 00 72 


MALIC ENZYME SIGNATURE 


PRO0O72B 13.77 7.955e- 
33 180-210 PR00072A 
12.75 6.040e-25 120- 
145 PR00072C 11.42 
2.286e-24 216-239 
PR00072D 10.77 3.400e- 
22 276-295 PR00072E 
10.54 1.360e-19 301- 
318 PR00072G 10.45 
5.304e-19 433-450 
PR00072F 8.87 5.935e- 
15 332-349 


1589 


BL00191 


Cytochrome t>5 family, 
heme-binding domain 
proteins - 


BL00191H 15.64 1.537e- 
22 61-113 BL00191K 
17.38 9.027e-12 398- 
442 


1590 


DM01970 


0 kw ZK632.12 YDR313C 
ENDOSOMAL III. 


DM01970B 8.60 7.716e- 
13 211-224 DM01970B 
8.60 2-l57e-12 94-107 


1591 


DM00517 


5 kw NUCLEAR 60.7 NUP1 
CHROMOSOME . 


DM00517B 10.96 6.625e- 
16 1175-1193 DM00517A 
8.21 X.000e-ll 1015- 
1026 


1592 


BL00037 

- 


Myb DNA-binding domain 
proteins repeat proteins 
proteins . 


BL00037B 15.92 3-250e- 
27 116-142 BL00037A 
16.68 2.500e-24 83-107 
BL00037A 16.68 3.250e- 
12 31-55 BL00037B 
15.92 3.526e-ll 64-90 
BL00037C 16.86 9.654e- 
10 146-164 


1595 


BL00028 


Zinc finger, C2H2 type, 
domain proteins. 


BL00028 16. 6i 1.514e- 
09 110-127 


1598 


PF00628 


PHD- finger. 


PF00628 15.84 3.250e- 
11 1667-1682 


1599 


PR00014 


FIBR0NECTIN TYPE III 
REPEAT SIGNATURE 


PR00014D 12.04 5.500e- 
09 980-995 


1600 


BL00518 


Zinc finger, C3HC4 type 
(RING finger) , proteins . 


BL00518 12.23 6-571e- 
10 30-39 


1602 


BL00412 


Neuromodulin (GAP-43) 
proteins . 


BL00412D 16.54 5.402e- 
10 136-187 


1605 


PF00651 


BTB (also known as BR- 
C/Ttfc> domain proteins. 


PF00651 15.00 3.571e- 
10 44-57 


1607 


BL00252 


Interferon alpha, beta 
and delta family 
proteins . 


BL0D252A 18.49 6.657e- 
23 20-57 BL00252B 
19.78 9.l25e-16 58-109 


1610 


DM00215 


PROLINE- RICH PROTEIN 3. 


DM00215 19.43 l.OOOe- 
08 61-94 


16-11 


BL00904 


Protein 

prenyltransferases alpha 
subunit repeat proteins 
proteins . 


BL00904C 8.98 7.353e- 
10 91-125 BL00904D 
1.47 6.018e-09 127-168 


1612 


PF00168" 


C2 domain proteins. 


PF00168C 27.49 3.250e- 
09 365-391 


1613 


BL00412 


Neuromodulin (GAP-43) 
proteins . 


BL00412D 16.54 6.051e- 
09 932-983 BL00412D 
16.54 7.153e-09 933- 
984 


1614 


BL00559 


Eukaryotic molybdopterin 
cocidoreductases 
proteins . 


BL00559I 13.63 3.531e- 
25 54-83 BL00559K 
13.17 2.957e-18 197- 
224 BL0C559J 19.63 
6.870e-16 124-176 
BL00559L 13.60 9.000e- 
16 266-284 


1615 


PD01427 


TRANSFERASE 

METH YLTRANS FERASE BI . 


PD01427B 22.45 3.025e- 
22 500-541 PD01427A 
19.94 8.773e-18 439- 
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ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 








472 


1616 


BL00115 


Eukaryotic RNA 
polymerase II 
hep tapep tide repeat 
proteins . 


BL00115Z 3.12 7.485e- 
09 152-201 BL00115Z 
3.12 9.6D3e-09 145-194 


1617 


BL00303 


S-100/ICaBP type calcium 
binding pxotein. 


BL0O303B 26.15 7.7S0e- 
32 51-88 BL00303A 
21.77 1.947e-31 4-41 


1618 


BL01254 


Petuin family proteins. 


BL012S4P 10.02 8.754e- 
09 137-147 


1619 


PD01888 


1 PEPTIDE REDUCTASE 
| PROTEIN METHI . 
1 


PD01888B 25.10 l.OOOe- 
40 47-97 PD01888C 
21.56 7.000e-30 125- 
155 PD01888A 12.84 
8.800e-15 7-23 


1621 


PR00239 


MOLLUSCAN RHODOPSIN C- 
TEKWINAL TAIL SIGNATURE 


PR00239E 1.58 3.455e- 
09 692-704 PR00239E 
1.58 4.580e-09 697-709 
PR00239E 1.58 d.5B0e- 
09 702-714 PR00239E 
1.58 5.193e~09 703-715 


1622 


PR00860 


VERTEBRATE 
METALLOTHIONEIN 

SIGNATURE 


PR00860B 7.04 1.900e- 
18 27-41 PR00860C 
9.61 1.474e-14 41-51 
PR00860A 5.46 1.720e- 
14 5-18 


1624 


PRO 07 84 


MITOCHONDRIAL BROWN FAT 
UNCOUPLING PROTEIN 
SIGNATURE 


PR00784D 15.86 8.027e- 
11 77-95 


1626 


BL00325 


Actin-depolymeri zing 
proteins . 


BL00325B 21.66 l.OOOe- 
40 93-139 BL00325A 1 
24.83 6.786e-23 61-93 


1631 


BL00064 


L- lactate dehydrogenase 
proteins . 


BL00064B 23.57 l.OOOe- 
40 82-130 3L00064C 
17.28 l.OOOe-40 137- 
182 BL00064E 27.20 
l.OOOe-40 223-275 
BL00064F 25.14 7.882e- 
36 286-331 BL00064A 
21.16 1.000e-33 22-60 
BL00064D 14.19 6.S00e- 
31 182-212 


1632 


PR00063 


RIBOSOMAL PROTEIN L27 
SIGNATURE 


PR00063B 15.24 9.700e- 
11 59-84 PR00063A 
11.71 1.614e-09 34-59 


1634 


PR00239 


MOLLUSCAN RHODOPSIN C- 
TBRMINAL TAIL SIGNATURE 


PR00239D 0.00 1.105e- 
11 36-49 PR00239C 
3.51 2.538e-09 37-45 


1636 


BL01210 


Caveolins proteins. 


BL01210B 13.92 9.531e- 
10 133-183 


1637 


BL00982 


Bacterial -type phytoene 
dehydrogenase proteins. 


BL00982A 18.41 5.388e- 
11 11-43 


1639 


BL01183 


ubiE/COQ5 

methyl transferase family 
proteins. 


BL01183B 21.31 8.144e- 
12 132-177 


1640 


PR00015 


GRAM -POSITIVE COCCUS 
SURFACE PROTEIN ANCHOR 
SIGNATURE 


PR00015B 9.84 8.468e- 
10 128-149 


1641 


PR00320 


G- PROTEIN BETA WD- 40 
REPEAT SIGNATURE 


PR00320B 12.19 5.935e- 
11 364-379 PR00320A 
16.74 7.828e-ll 364- 
379 PR00320C 13-01 
2.800e-10 279-294 
PR00320C 13.01 2.800e- 
10 364-379 PR00320B 
12.19 5.114e-10 279- 
294 PR00320A 16.74 
1.659e-09 279-294 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 








PR00320A 16.74 2.098e- 
09 229-244 


1642 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 6.464e 7 
09 114-130 


1643 


PR00169 


POTASSIUM CHANNEL 
SIGNATURE 


PR00169A 16.77 1.806e- 
11 74-94 


1644 


BL00678 


Trp-Asp (WD) repeat 
proteins proteins. 


BL00678 9.67 2.200e-10 
109-120 BL00S78 9.67 
5.737e-09 528-539 


1645 


BL01108 


Ribosoraal protein L24 
proteins . 


BL01108A 20.33 7.366e- 
17 56-89 


16T6 


PRO 03 80 


KINESIN HEAVY CHAIN 
SIGNATURE 


PR00380A 14.18 9.270e- 
21 103-125 PR003B0D 
9.93 6. 308e-18 386-408 
PR00380C 13.18 7.923e- 
16 332-351 PR0038OB 
12.64 6.657e-15 292- 
310 


1647 


DM01242 


3 THREONINE— TRNA 
I»IGASE. 


DM01242C 17.15 9.79le- 
37 340-381 DM01242E 
23.00 S.071e-31 463- 
505 DM01242D 23.29 
3.925e-30 420-463 
DM01242B 23.57 8.054e- 
18 265-314 DM01242F 
10.61 7.6l8e-14 526- 
540 


1649 


PD00126 


PROTEIN REPEAT DOMAIN 
TPR NUCTiKA. 


PD00126A 22.53 5.500e- 
10 13-34 


1651 


BI1OII6O 


Kinesin light chain 
repeat proteins. 


BL01160B 19.54 6.720e- 
11 431-485 


1652 


"BL00933 


FGGY family of 
carbohydrate kinases 
proteins . 


BL00933A 17.50 4.673e- 
12 11-35 BL00933E 
13.80 9.217e-09 456- 
472 


1653 


BL00795 


Involucrin proteins. 


BL007S5C 17.06 2.988e- 
10 70-115 


1654 


BL00982 


Bacterial- typeCphytoene 
dehydrogenase proteins. 


BL00982A 18.41 7.750e- 
17 302-334 


1655 


BL009B2 


Bacterial -type phytoene 
dehydrogenase proteins. 


BL00982A 18.41 7.750e- 
17 282-314 


1656 


BL00741 


Guanine - nucl eot ide 
dissociation stimulators 
CDC24 family sign. 


BIi00741B 14.27 1.391e- 
16 607-630 


1657 


PR00449 


TRANSFORMING PROTEIN P21 
RAS SIGNATURE 


PR00449A 13.20 7. 93 Be- 
ll 114-136 


1658 


PR00910 


LUTEOVIRUS 0RF6 PROTEIN 
SIGNATURE 


PR00910A 2.51 8.889e- 
10 442-455 


1659 


BL00372 


Ubiquitin carboxyl- 
terminal hydrolases 
family 2 proteins. 


BL00972D 22.55 4.140e- 
12 376-401 BL00972E 
20.72 5.629e-09 446- 
468 


1660 


BLQ0406 


Ac tins proteins. 


BL00406D 12.58 8.767e- 
15 188-243 


1*61 


PR00105 


CYTOSINB-SPECIFIC DNA 

METHYLTRANSFERASE 

SIGNATURE 


PR00105A 10.36 4.900e- 
13 1140-1157 PR00105B 
12.32 2.800e-12 1259- 
1274 PR00105C 10.86 
1.000e-10 1305-1319 


1662 


BCD 0280 


Pancreatic trypsin 
inhibitor (Kunitz) 
family proteins. 


BL00280 24.61 3-172e- 
33 3119-3163 


1663 


PR00319 


BETA G-PROTBIN 
(TRANSDUCIN) SIGNATURE 


PR00319D 11.64 6.625e- 
23 107-125 PR00319C 
13.41 5.714e-20 89-105 
PR00319A 15.27 5.286e- 
19 51-68 PR00319B 
11.47 8.2O0e-19 70-85 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 


1664 


BL0OO18 


EF-hand r*R 1 c i \im— Yi i nci i rici 
domain proteins - 


BL0001B 7 41 S OSOr-IO 
489-502 


1667 


PD01066 


PROTEIN ZINC FINGBR 
ZINC- FINGER METAL - 
BINDING NU. 


PD01066 19.43 8.500e- 
38 7-46 


1669 


BL01153 


N0Ll/N0P2/sun family 
proteins . 


BL01153D 19.69 1.188e- 
17 115-141 BL01153C 
13.67 8.977e-15 66-80 
BL01153B 20.52 l.S85e- 
10 13-37 


1671 


PRO 06 78 


PI3 KINASE P85 
REGULATORY SUBUNIT 
SIGNATURE 


PR00678H 9.13 3.100e- 
10 1146-1169 


1672 


BhQ059& 


Chromo domain proteins. 


BL00598 14.45 8.500e- 
20 27-49 


1673 


PR00326 


GTPl/OBG GTP- BINDING 
PROTEIN FAMILY SIGNATURE 


PR00326A 8.75 8.329e- 
09 686-707 


1674 


PR0O049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 7.580e- 
11 343-358 PRD0049D 
0.00 1.2B6e-10 342-357 


1676 


PRO 0747 


GLYCOSYL HYDROLASE 
FAMILY 47 SIGNATURE 


PR00747H 12.76 8.636e- 
19 427-448 PR00747G 
14.50 2.286e-18 368- 
393 PR00747C 12.06 
7.500e-18 112-131 
PR00747A 14.05 4.600e- 
17 42-63 PR00747D 
15.23 8.759e-17 163- 
183 PR00747E 15.13 
8.244e-15 254-272 ! 
PR00747B 7.65 5.355e- 
13 75-90 PR00747F 
13.56 8.714e-10 311- 
328 


1677 


PR00747 


GLYCOSYL HYDROLASE 
FAMILY 47 SIGNATURE 


PR00747H 12.76 8.636e- 
19 309-330 PR00747G 
14.50 2.286e-18 250- 
275 PR00747C 12.06 
7.500e-18 112-131 
PR00747A 14.05 4.600e- 
±1 ilZ-DJ JrKUU J*h its 
7.65 5.355e-13 75-90 
PR00747P 13 56 fi 7l4e~ 

rivVV / ± f IT -L-> - Oaf ItC 

10 193-210 


1680 


BLQ0678 


Trp-Asp (WD) repeat 
proteins proteins. 


BL00678 9.67 4.600e-10 
406-417 BL0067B 9.67 
6.684e-09 320-331 


1681 


BL00678 


Trp-Asp (WD) repeat 
protein© proteins. 


BL00678 9.67 4.6COe-10 
329-340 BL00678 9.67 
6.684e-09 243-254 


1683 


PR00326 


GTPl/OBG GTP -BINDING 
PROTEIN FAMILY SIGNATURE 


PR00326A 8.75 1.346e- 
13 389-410 


1685 


PRO 064 6 


RDC1 ORPHAN RECEPTOR 
SIGNATURE 


PR00646H 6.32 4.1886- 
09 755-771 


1690 


BL01160 


Kinesin light chain 
repeat proteins. 


BL01160B 19-54 6.644e- 
09 75-129 


1691 


PR00456 


RlBOSOMAL PROTEIN P2 
SIGNATURE 


PR0Q456R "\ 06 7 ?flle- 
10 418-433 PR00456E 
3.06 7.281e-10 419-434 
PR00456B 3.06 8.125e- 
10 420-435 


1692 


PR0045* 


RlBOSOMAL PROTEIN P2 
SIGNATURE 


PR00456E 3.06 7.281e- 
10 487-502 PR004S6B 
3.06 7.281e-10 488-503 
PR00456E 3.06 8.125e- 
10 489-504 


1693 


BL00674 


AAA- protein family 
proteins . 


BLD0674C 22.60 8.043e- 
24 274-317 BL00674B 
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SEQ IB NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 








4.46 4.000e-23 241-263 
BL00674D 23.41 8.560e- 
18 338-385 BL00674E 
15.24 1.720e-15 414- 
434 


1697 


PR00409 


PHTHALATE DIOXYGENASE 
REDUCTASE FAMILY 
SIGNATURE 


PR00409F 12.70 4.388e- 
10 427-447 


1698 


PR00466 


CYTOCHROME B-245 HEAVY 
CHAIN SIGNATURE 


PR00466C 10.17 3.443e- 
13 187-208 PR00466B 
5.03 5.500e-ll 162-186 
PR00466F 9.16 6.l59e- 
09 498-517 


1699 


BL00028 


Zinc finger, C2H2 type, 
domain proteins. 


BL0OQ28 16.07 9.217e~ 
12 283-300 BL00028 
16.07 3.769e-ll 255- 
272 BL00028 16.07 
S.154e-ll 171-188 
BL00028 16.07 5.500e- 
11 227-244 BL00028 
16. 07 1.6006-10 199- 
216 


1700 


BL01019 


ADP-ribosylation factors 
family proteins. 


BL01019A 13.20 3.348e- 
15 62-102 BL01019B 
19.49 4.000e-15 107- 
162 


1703 


PD01066 


PROTEIN ZINC FINGER 
ZINC- FINGER METAL- 
BINDING NU. 


PD01066 19.43 2.484e- 
12 200-239 


1707 


PK00109 


TYROSINE KINASE 
CATALYTIC DOMAIN 
SIGNATURE 


PR00109B 12.27 4.558e- 
14 134-153 


1710 


PRO 001 9 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00O19A 11.19 2.565e- 
10 116-130 PR00019B 
11.36 4.600e-09 113- 
127 PR00019B 11.36 
7.120e-09 204-218 


1711 


BL01159 


WW/r S p5/WWP domain 
proteins . 


BL01159 13.85 6.523e- 
11 232-247 BL01159 
13.85 5.408e-10 613- 
628 


1712 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 7-O00e- 
10 187-203 


1713 


PPO0642 


Zinc finger C-x8-C-x5-C- 
x3-H type (and similar) . 


PF00642 11.59 9.550e- 
11 230-241 


1714 


PF00642 


Zinc finger C~x8-C-x5-C- 
X3-H type (and similar) . 


PF00642 11.59 9.550e- 
11 230-241 


1715 


BL01115 


GTP- binding nuclear 
protein ran proteins. 


BL01115A 10.22 7.129e~ 
09 7-51 


1718 


BL00353 


HMG1/2 proteins. 


BL00353C 14.83 6.01Be- 
10 136-183 BL00353B 
11.47 8.866e-09 86-136 


1719 


BIi00412 


Neuromodulin (GAP-43) 
proteins. 


BL00412D 16.54 5.40Be- 
09 432-483 


1721 


BLD0038 


Myc-type, 'helix- loop- 
helix 1 dimerization 
domain proteins. 


BL00038B 16.97 8.448e- 
12 79-100 BL00038A 
13.61 4.000e-ll 52-68 


1723 


PD00567 


PROTEIN RNA- BINDING RNA 
REPEAT HYD. 


PD00567C 9.17 B.500e- 
09 418-428 


1724 


BL01279 


Protein -L~ 
isoaspartate {D~ 
aepartate) 0- 
methyl transferase signa. 


BL01279A 24.27 5.663e- 
12 233-281 


1728 


BL00018 


EF-hand calciun -binding 
domain proteins. 


BL00018 7.41 2.059e-ll 
73-86 " BL00018 7.41 
4.176B-11 157-170 


1730 


RL00594 


Aromatic amino acids 
permeases proteins. 


BL00594A 16.75 1.0B9e- 
09 17-61 
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ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 


1731 


BL01160 


Kinesin light chain 
repeat proteins . 


BL01160B 19.54 9-676e- 


1732 


BL01160 


Kinesin light chain 
repeat proteins. 


BL01160B 19.54 9.676e- 
10 316-370 


1 OO 1 


Fr UOfcOU 


Hi stone deacetylase 
family . 


Di?finQcnc' i r: -in a liiao. 
rrUOobub 1-i . *U »i»J4ire- 

22 246-279 PF00850D 

ia nc c Qcn Q on i *7*7 
j.**./o o.orsue-^u x//- 

201 PF00850E 8.88 

8.69le-18 209-235 

PF00850G 22.75 4.098e- 

14 281-323 


1734 


BLQ0354 


HML»~I and HMt»-X UNA- 

binding domain proteins 
(Ahook) . 


BLiUU3b4C. b.ol 5.932e- 
09 292-307 


1735 


DM00179 


w KINASE ALPHA ADHESION 
T-CELL. 


DM00179 13.97 5.263e- 
10 492-502 


1743 


PRO 04 4 9 


TRANSFORMING PROTEIN P21 
RAS SIGNATURE 


PR00449A 13.20 1.188e- 
11 5-27 PR00449D 
10.79 2.241e-10 109- 
123 PR00449E 13.50 
9.289e-10 144-167 


1744 


PR00449 


TRANSFORMING PROTEIN P21 
RAS SIGNATURE 


PR00449A 13.20 1.188e- . 
11 5-27 PR00449D 
10.79 2.241e-10 109- 
123 PR00449E 13.50 
9.289e-10 144-167 


1745 


BL00720 


Guanine- nucleotide 
dissociation stimulators 
CDC25 family sign. 


BL00720B 16.57 8.297e- 
15 136-160 


1746 


PR00081 


GLUCOSE/RIBITOL 
DEHYDROGENASE FAMILY 
SIGNATURE 


PR0OOB1B 10.38 6.727e- 
11 45-57 PR00081E 
17.54 3.935e-10 150- 
168 


1747 


BL00439 


Acyl transferases 
ChoActase / COT / CPT 
family proteins. 


BL00439H 18.24 8.435e- 
14 6S-91 BL00439G 
13.40 2.895e-12 3-14 


1749 


PRO 0819 


CBXX/CFQX SUPERFAMILY 
SIGNATURE 


PR00819B 10.83 7.lS8e- 
11 4-20 


1751 


PD00066 


PROTEIN ZINC- FINGER 
METAL- BIND I . 


PD00066 13.92 3.400e- 
14 33-46 PD00066 
13.92 1.000e-13 89-102 
PD00066 13.92 7.0D0e- 
13 61-74 PD00066 
13.92 6.571e-12 117- 
130 


1753 


BL01013 


Oxysterol -binding 
protein family proteins. 


BL»01UX3D 26.81 b.blbe- 
18 33-77 


1754 


TJT nnnnn 


Receptor tyrosine Jcinase 
class V proteins. 


tjt nmonT on fit o loio^ 
09 490-521 BL0O79OI 

Oft O R?1#»-nQ 60-91 

BL00790I 20.01 6.357e- 
09 287-318 


1756 


PD01066 


PROTEIN ZINC FINGER 
ZINC -FINGER M3TAL- 
BINDING NU. 


PD01066 19.43 9.750e- 
35 10-49 


1758 


DM00406 


GLIADIN. 


DM00406 7.73 7.600e-09 
653-666 


1762 


PD02929 


ADHESION GLYCOPROTEIN 
PRECURSOR I. 


PD02929A 28.27 4.529e- 
09 224-278 


1765 


PR00326 


GTP1/OBG GTP -BINDING 
PROTEIN FAMILY SIGNATURE 


PR00326A 8.75 5.950e- 
11 146-167 


1775 


PFO0023 


Ank repeat proteins. 


PF00023A 16.03 3,077e- 
14 523-539 


1776 


BL00942 


glpT family of 
transporters proteins. 


BL00942F 15.07 4.343e- 
10 371-389 BL00942B 
20.36 8.040e-09 94-137 


1777 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 2.373e- 
09 279-312 
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SEQ ID NO: 


ACCESSION 
NO. 


DESCRIPTION 


RESULTS* 


1778 


BL00084 


Copper type II, 

a 9 corba t e - dependent 

uivJiiv wvyvj ciiatscs proteins , 


BL00084D 25.11 3.700e- 
20 169-224 BLOO084B 

OA *>C ft 1 lAn 1C 1 ft CO 

BL0O0B4C 27.71 8.412e- 
11 107-15B 


1779 


BL01013 


protein family proteins. 


18 611-655 BL01013A 
25.14 2.831e-15 344- 
380 BL01013C 3.97 
6.308e-13 435-445 
BL01013B 11.33 3.717e- 
12 409-420 


1783 


BL00741 


Guanine -nucleotide 
dissociation stimulators 
CJ0C24 family sign. 


BL00741B 14.27 8.l38e- 
13 492-515 


1784 


BL00741 


Guanine - nucl eo t i de 
dissociation stimulators 
CDC24 family sign. 


BL00741B 14.27 8.138e- 
13 492-515 



* results include in order: accession number subtype; raw score; p-value; postion of 
signature in amino acid sequence. 
TRADOCS: 14 16223. J (%CRJ0l LDOC) 
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TABLE 4 



SSQ ID 
NO: 


PFAM NAME 


description 


~ vj^ 


PPAM 


2 




Immunoglobulin domain 


2.le-32 


109.5 


3 


pkinase 


Eukaryotic protein kinase 
domain 


1 . 3e-29 


Tin i 


4 


zf-C2H2 


Zinc finger, C2H2 type 


1.6e-21 


84 . 9 


5 


fn3 


Fibronectin type III domain 


0 


1097 . 1 


6 


fn3 


Fibronectin type III domain 


0 


1035.0 


7 


fn3 


Fibronectin type III domain 


o 


1 non a 


8 


fn3 


Fibronectin type III domain 


0 


1097.1 


9 


TBC 


TBC domain 


4e-40 


14b . / 


10 


p450 


Cytochrome P450 


9.5e-17 


62.0 


12 


auk. 




6e-20 


79.7 


14 


ig 


Immunoglobulin domain 


1.7e-05 


22.7 


15 


yf-MVNn 


nxNu linger 


1. 3e-06 


35.4 


16 




MYND f inger 


1.3e-06 


35.4 


17 


z£-C2H2 


Zinc finger, C2H2 type 


1.7e-99 


343 .9 


18 


CAP_GLY 


CAP-Gly domain 


1.2e-25 


98.7 


20 




IMP dehydrogenase / GMP 
reductase C terminus 


1. Se-119 


410.5 


21 


IMPDH__C 


IMP dehydrogenase / GMP 
reductase C terminus 


4.3e-102 


352.6 


22 




Eukaryotic protein kinase 
domain 


2.4e-79 


277.0 


23 


pkinase 


Eukaryotic protein kinase 
domain 


8.4e-74 


258.6 


25 


A_p O X A 


RNA polymerase alpha subunit 


0 


1077.7 


25 


L.iq 


Clq domain 


1.9e-10 


44.4 


27 


Ribosonial_L»2 
3 


Ribosoroal protein L23 


7. Be-32 


111.2 


28 


Kijuusoniax 
3 


Ribosomal protein L23 


le-29 


104.2 


30 


zf-A20 


«uu"xxa.c Aiuc r my ci 


1 ♦ 5e-l0 


48.5 


31 


zf-A20 


A2 0-1 ike zinc linger 


1.5e-10 


48.5 


32 


PMN dh 


FMN- dependent dehydrogenase 


5.4e-179 


608.1 


34 


PID 


Phospho tyrosine interaction 

QOIDuXIj \irLafcXU} 


3.8e-59 


209.9 


35 


ig 


Immunoglobulin domain 


1.4e-13 


48.8 


36 


icr 

xy 


1 mnuinogl obul in doma in 


1 . 4e-13 


48. a 


40 




Kinesin motor domain 


6 . 7e-76 


265.6 


44 


Ets 


X? t~ f^rwTi^ A y\ 

*>tp viOHlcLXU 


1 . 4e-56 


182.1 


45 


Ets 


fits -domain 


1.4e~56 


182.1 ~~ 


46 


LRR 


Leucine Rich Repeat 


1.7e-13 


58.3 


48 


zf -C?H? 


*»xnc tinger, v^ziiA type 


2 .3e-162 


552 .8 


49 


IT AM 


Iromunoreceptor tyrosine— based 
activation mot 


1.4e-05 


31.9 


50 


OCH-2 


Obi qui tin carboxyl- terminal 
hydrolase family 


x . xe— £o 


102 . 0 


51 


UCH-2 


Ubiquitin carboxyl -terminal 
hydrolase family 


l.le-26 


102.0 


52 


rae 


Ras family 


8 .5e-45 


162 , 3 


53 


PRK i 


Phosphoribulokinase 


2 . le-65 


230 .7 


54 


rayb_DNA- 
binding 


Myb-like DNA-binding domain 


0.096 


15 .2 


55 


voltage_CLC 


Voltage gated chloride channels 


3.3e-186 


631.9 


56 


sugar^tr 


Sugar (and other) transporter 


0.00015 


-64-3 


57 


TBC 


TBC domain 


2.2e-37 


137.6 


58 


ank 


Ank repeat 


5_9e-25 


96.3 


59 


ank 


Anlc repeat 


5.9e-25 


96.3 


67 


PMP22_Claudx 

EL 


PMP-22/BMP/MP20/Claudin family 


7.9e-49 


175.6 


68 ~" 


C2 


C2 domain 


7.9e-S4 


192.2 


69 


C2 


C2 domain 


2.3e-54 


194.0 


70 


Kelch 


Kelch motif 


9.4e-99 


341.5 


72 


ig 


Immunoglobulin domain 


8.2e-28 


94 .7 


73 


pkinase 


Eukaryotic protein kinase 


Be-69 


242.1 
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SBQ ID 
NO: 


PFAM NAME 


DESCRIPTION 


p -value 


PFAM 
SCORE 




_ . 


domain 


■ 




74 


pkinase 


Eukaryotic protein kinase 
domain 


2 .8e-38 


140 . 6 


76 


z£- 

C4_Topoi som 


Topoisomerase DNA binding C4 
zinc fing 


5.4e~54 


192,8 


83 


Peptidase_S9 


Prolyl oligopeptidase family 


4,3e-10 


36.8 


84 


fn3 


Fibronectin type III domain 


4 . le-51 


183 . 2 


86 


SH2 


Src homology domain 2 


3 - le-22 


67 . 7 


88 


*9 


Immunoglobulin domain 


0.0091 


14 . 0 


89 


WD40 


WD domain, G-beta repeat 


2 - le-21 


84 . 6 


92 


lamininjG 


I»aminin G domain 


6.1e-27 


98 . 5 ; 


93 


AMP -binding 


AMP -binding enzyme 


2.4e-13 


-37.2 


95 


pkinase 


Eukaryotic protein kinase 
domain 


1.4e-59 


211.4 


96 


pklnase 


Eukaryotic protein kinase 
domain 


2.6e-5l 


183 .9 


97 


adh_3hort 


short chain dehydrogenase 


2e-61 


217.5 


98 


kxneoin 


Kinesin motor domain 


2.2e-86 


300.4 


101 


IRS 


PTB domain (IRS-1 type) 


5-4e-36 


133.0 


102 


AAA 


ATPases associated with various 
cellular act 


6.Be-05 


-5.2 


104 


pkinase 


Eukaryotic protein kinase 
domain 


2.7e-73 


256.9 


106 


ras 


Ras family 


8.3e-24 


92.5 


107 


FYVE 


FYVE zinc finger 


5.4e-27 


100.7 


108 


Cyt__reductas 
e 


FAD/NAD- binding Cytochrome 
reductase 


7.7e-6l 


215.5 


109 


zf-C2H2 


Zinc finger, C2H2 type 


2.3e-122 


420.0 


113 


pkinase 


Eukaryotic protein kinase 
domain 


4e-8B 


306.2 


116 


PH 


PH domain 


3.1e~ll 


45.2 


111 


lipocalin 


Lipocalin / cytooolic fatty- 
acid binding pr 


2.4e~14 


53.5 


118 


pkinase 


Eukaryotic protein kinase 
domain 


4.5e-20 


76.3 


120 


WD40 


WD domain, G-beta repeat 


2.4e-14 


61.1 


121 


WD40 


WD domain, G-beta repeat 


2.4e-14 


61.1 


123 


IP5_eIF4_eIF 
2 


el P4 -gamma /eIF5 / e I ?2 - eps i Ion 


le-32 


122.2 


124 


ig 


Immunoglobulin domain 


6 .Se-08 


30.4 


127 1 


ralto_carr 


Mitochondrial carrier proteins 


3e-l6 


58.6 . 


128 


PP2C 


Protein phosphatase 2C 


2.2e-7l 


250.6 


129 


ATP1G1_PLM_M 
AT8 


ATP1G1/PLM/MAT8 family 


3 .le-20 


80.6 


130 


pfkB 


pfkB family carbohydrate kinase 


4.5e-42 


137.1 


133 


ACBP 


Acyl CoA binding protein 


4 . 6e-22 


86.7 


134 


TCXXtl 


RNA recognition motif. 


1.2e-3i 


118.5 


135 


IQ 


IQ calmodulin- binding motif 


2 .6e-08 


41.0 


136 


ATP1G1 PLM_M 
ATS 


ATP1G1/PLM/MAT8 family 


9 .3e-22 


85 .7 


-tin 

139 


WH2 


Wiskott Aldrich syndrome 
homology region 2 


0 . 0067 


23 . 1 


140 


zf -C2H2 


Zinc finger, C2H2 type 


1 .7e-82 


287 . 5 


141 


Peptidase_jS2 
6 


Signal peptidase 1 


5.7e-10 




35.7 


143 


arf 


ADP - r xbosyl at l on factor family 


1 . 2e-39 




146 


KRAB 




7.3e-30 


112 . 6 


148 


DUP6 


Integral membrane protein DUF6 


0.096 


8.0 


149 


PDEase 


3 151 -cyclic nucleotide 
phosphodiesterase 


3.8e-80 


231.1 


151 


S4 


S4 domain 


l.le-08 


42.3 


153 


tRNA~synt_ld 


tRNA synthetases class I (R) 


3.8e-103 


356.1 


154 


Cytjceductaa 
e 


FAD/NAD-binding Cytochrome 
reductase 


7.8e-60 


212.2 


155 


ras 


Ras family 


3.6e~28 


107.0 


157 


actin 


Actin 


3.8e-26 


87.1 
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SEQ ID 
NO: 


PFAM NAME 


DESCRIPTION 


p -value 


PFAM 
SCORE 


158 


Jacalin 


Jacal in-like lectin domain 


0.09 


-24.9 


160 


Zn_carbOpep t 


Zinc carboxypeptidase 


5e-138 


471.9 


165 


pkinase 


Eukaryotic protein kinase 
domain 


5.1e-67 


236.1 


167 


zf-C3HC4 


Zinc finger, C3HC4 type (RING 
finger) 


5.3e-07 


27.0 


168 


Ribosomal_si 
5 


Ribosomal protein S15 


l.le-06 


29.0 


169 


DEAD 


DEAD/DEAH box helicace 


le-48 


157:0 


171 


DUF59 


Domain of unknown function 
DUFS9 


0.07 


-17.4 


172 


pkinase 


Eukaryotic protein kinase 
domain 


3.7e-15 


58.6 


173 


globin 


Globin 


4.6e-18 


67.4 


174 


WW 


WW domain 


7.3e»06 


32.9 


175 


ras 


Ras family 


le-31 


118.8 


178 


ATP1G1_PLM M 
AT8 


ATP1G1/PLM/MAT8 family 


2.5e-17 


71.0 


179 


Zf-C2H2 


Zinc finger, C2H2 type 


1 .5e-99 


344 .2 


180 




Clq domain 


B.8e-72 


251.9 


190 


Yjphosphatas 


Protein- tyrosine phosphatase 


4.9e-287 


967.0 


191 


efhand 


EP hand 


7.5e-l6 


66.1 


193 


|J A- -L I 1<J & t; 


domain 


6.5e-82 


285 . 6 


194 




A rrun r>cl r*m^ i n 

Di l^/IUwV&U&ICa _L 11 


5 .8e-31 


111 . 4 


195 


PALP 


Pyr idoxal -phospha t e dependent 
piizymc 


2.5e-64 


227.1 


197 


DnaJ 


DnaJ domain 


1.6e-38 


141.4 


1 QQ 


RmaAD 


KIDOSunidX tin A dUfctll-LJ- 1? 

dimethylases 


a Artft-tft 

U . UUUlt) 


±o . y 






ai.oLlUJ.llt: aLIU I 1LJ£»£JI la L- <X i»C 


~2~5e-10 


37 .2 


201 


WH2 


Wiskott Aldrich syndrome 
homology region 2 


0 .00048 


26.9 


204 


vATP- 
synt AC39 


ATP synthase (C/AC39) subunit 


1.3e-159 


543.7 


205 


vATP- 
aynt_AC39 


ATP synthase (C/AC39) subunit 


1.6e-139 


476.9 


206 


ldl recept a 


Low-density lipoprotein 
receptor domain 


2.4e-25 


97.6 


209 


ftok 


Ank repeat 


1.4e-19 


78.4 


210 


Rhomboid 


Rhomboid family 


0.0035 


1.2 


211 


Clq 


Clq domain 


1.6e-70 


247.7 


212 


UQcon 


Ubiquit in -conjugating enzyme 


7.4e-74 


258.8 


213 


UQcon 


Ubiquitin- conjugating enzyme 


le-53 


191.9 


215 


DEAD 


DEAD/DEAH box helicase 


1.8e-43 


140.4 | 


216 


PMP22_Claudi 
n 


PMP-22/EMP/MP20/Claudin family 


4.5e-21 


83.4 


21B 


Glycos trans 
f_2 


Glycosyl transferases 


4e-21 


63.6 


219 


ig 


Immunoglobulin domain 


0.092 


10.7 


222 


WD4D 


WD domain, G-beta repeat 


7 . 4e-23 


89.4 


-2 _ 24 


TPR 


TPR Domain 


1.2e-08 


42.1 


225 * 


DnaJ CXXCXGX 
0 


DnaJ central domain (4 repeats) 


1.5e-38 


141.5 


226 


DnaJCXXCXGX 
G 


DnaJ central domain {4 repeats) 


1.5e-38 


141.5 


229 


HSP70 


Hsp70 protein 


2.4e-54 


194.0 


230 


GSHPx 


Glutathione peroxidases 


3.4e-47 


170.2 


231 


tsp_l 


Thrombospondin type l domain 


0.0075 


17.1 


233 


cyclin 


Cyclin 


4 .6e-144 


492.0 


234 


ras 


Ras family 


4.8e-50 


179.7 


235 


LRR 


Leucine Rich Repeat 


1 .2e-30 


115.3 


236 


LRR 


Leucine Rich Repeat 


6 .7e-29 


109.4 


237 


PDZ 

V 


PDZ domain (Also known as DHR 
or GLGP) . 


1.7e-09 


45.0 
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SEQ ID 
NO; 


PFAM NAM 3 


DESCRIPTION 


p-value 


PFAM 
SCORE 


244 


dCMP_cyt_dea 
m 


Cytidine and deoxycytidylate 
deaminase 


2_5e-05 


31.1 


245 




Immunoglobulin domain 


6.7e-08 


30.5 


248 


wnt 


wnt family of developmental 
signaling protei 


9.1e-270 


742.6 


250 


mito_carr 


Mitochondrial carrier proteins 


1.3e-55 


193.6 


254 


adenylatekin 
ase 


Adenylate kinase 


1.8e-14 


55.7 


255 


Cation__efflu 

X 


Cation efflux family 


2.8e-33 


124.0 


256 


SH3 


SH3 domain 


3.9e-14 


60.4 


257 


Aa_trans 


Transmembrane amino acid 
transporter protein 


2 .6e-52 


187.2 


258 


adenylatekin 
ase 


Adenylate kinase 


2.1e-110 


380.2 


253 


HIT 


HIT family 


8.2e-07 


25.3 


260 


Bacterial PQ 

Q 


PQQ enzyme repeat 


1.6e-15 


65.0 


262 


proteasome 


Proteasome A- type and type 


6 . 5e-64 


225 . 7 


267 


pkinase 


Eukaryotic protein kinase 
domain 


6 . 3e-27 


101. 0 


270 


filament 


Intermediate filament proteins 


3 .2e-150 


512 .5 


271 


Choline_kina 
se 


Choline/ethanolamine kinase 


2e-67 


237.4 


277 


Ribosoraai S7 


Ribosoraai protein S7p/S5e 


3.3e-20 


80.6 


279 


pkinase 


Eukaryotic protein kinase 
domain 


3 . 3e-77 


269 .9 


280 


WD40 


WD domain, G-beta repeat 


7 m 8e-73 


255 .4 


281 


WD40 


WD domain, G-beta repeat 


7 . 8e-73 


255.4 


284 


zf-DHHC 


DHHC zinc finger domain 


4 . 6e*-24 


93 . 4 


287 


Bxonucleaee 


Exonuclease 


1.4e-67 


238.0 


291 


SAM 


SAM domain (Sterile alpha 
motif) 


0.034 


11.2 


292 


SAN 


SAM domain (Sterile alpha, 
motif) 


0 . 034 


11 . 2 


294 


zf-C2H2 


Zinc finger, C2H2 type 


1 .4e-29 


111.7 


295 


zf-C2H2 


Zinc finger, C2H2 type 


2.2e-125 


430.0 




mito_carr 


Mitochondrial carrier proteins 


4.1e~59 


205.5 


297 | 


ilMG box 


HMG (high mobility group) box 


6.7e-29 


109.4 


302 


GlycoB trans 
£ 4 


Glycosyl transferase 


5e-87 


302 . 5 


304 


tRNA-synt_2 


tRNA synthetases class II (D, K 
and N) 


l.le-84 


294.8 


305 


KRAB 


KRAB box j 


2e-44 


161.0 


306 


rrra 


RNA recognition motif. 


2.7e-44 


160.6 


308 


7tra_l 


7 transmembrane receptor 
(rhodopsin family) 


5.2e-39 


126M 


309 


DNA_polyroera 
seX 


DNA polymerase X family 


2.4e-64 


227.2 


311 


F-box 


F-box domain. 


9.5e-08 


39.2 


312 


±g 


Immunoglobulin domain 


6.8e-l9 


65.9 


313 


Ets 


Ets -domain 


8.1e-60 ! 


192.3 


315 


Kelch 


Kelch motif 


I.3e-106 


367.6 


317 


arf 


ADP-rlbosylation factor family 


3.2e-35 


130.4 


318 


sugar_tr 


Sugar (and other) transporter 


0.0003 


-73.1 


320 


pkinase 


Eukaryotic protein kinase 
domain 


8.1e-83 


288.6 


322 


pkinase 


Eukaryotic protein kinase 
domain 


4.9e-81 


282.6 


324 


Xlink 


Extracellular link domain 


4.5e-143 


331.5 


326 


ARID 


ARID DNA binding domain 


5.1e-37 


136.4 


327 


HMGjbox 


HMG thigh mobility group) box 


6.7e-29 


109.4 


328 


cadherin 


Cadherin domain 


8.1e-81 


281.9 


331 


chromo 


'chromo' (CHRroraatin 
Organization Modifier) 


4e-l8 


66.7 


333 


Peptidase_M2 
2 


Glycoproteaee family 


1.2e-136 


467.4 



249 



WO 01/53312 



PCT/US00/34263 



SEQ ID 
NO? 


PFAM NAME 


DESCRIPTION 


value 


PFAM 
SCORE 


335 


vwa 


von Willebrand factor type A 
domain 


2 .3e-07 


37 . 9 


339 


ras 


Ras family 


7.8e-07 


-59.1 


340 


zf-C2H2 


Zinc finger, C2H2 type 


8.2e-64 


225.4 


342 


2f-C2H2 


Zinc finger, C2H2 type 


2.4e-B5 


297.0 


343 


xg 


Immunoglobulin domain 


O.OD05 


18-0 


346 


pkinase 


Eukaryotic protein kinase 
domain 


6.5e-65 


229.1 


347 


pkinase 


Eukaryotic protein kinase 
domain 


6-.5e-6S"' 


229.1 


351 


EGF 


EGF- like domain 


8.5e-20 


79.2 


352 


ank 


Ank repeat 


2.5e-10l 


350.0 


354 


TBC 


TBC domain 


5.1e-15 


63.3 


355 


PHD 


PHD- finger 


3 .2e-07 


37.4 


358 


DUF6 


Integral membrane protein DOF6 


0.033 


15 . 8 


359 


Zf-C2H2 


Zinc finger, C2H2 type 


7.4e-20 


79.4 


361 


ank 


Ank repeat 


6 .6e~34 


126.1 


362 


ArfGap 


Putative GTP-ase activating 
protein for Arf 


4,7e-53 


189.7 


363 


ef hand 


BF band 


5 .4e-10 


46 . 6 


367 


LRR - 


Leucine Rich Repeat 


8. de-44 


158 .3 


368 


laminin_G 


Laminin G domain 


1.5e-33 


121.7 


369 


PP2C 




5 . 3e-20 


73 . 9 


372 


LIM 




9 . 9e-15 


57.1 


373 


KRAD 




4 . 8e-23 


90 . 0 


376 






2 . 9e- 09 


-4.2 


377 




Dai rrA / Hl7lif* T U rlnms'iiri 
otrxy t;/ omu-n uuiiia-Lii 


"* * JXZ A \J o 


704 , 5 


380 






1 . 6e- 94 


327.5 


381 


AMP-bindinci 


AMP-hi nrl "i ttct on ?vmf* 


1.4e-07 


-140 .3 


382 


HECT 


HECT-domain (ubiquitin- 
transferase) 


1.3e-07 


-13.5 


384 


ank 


Ank repeat 


2.5e-101 


350 „ 0 


386 




Immunoglobulin domain 


9 . 5e-0S 


23 . 6 


388 


zf-C2H2 


Zinc finger, C2H2 tyoe 


i.7e-42 


154 -6 


389 


iQ 


Immunoglobulin domain 


2 .8e-15 


54 . 3 


390 


mi to carx 


Mitochondrial carrier proteins 


3 ,5e-67 


233 . 2 


392 


TPR 


TPR Domain 


6.1e-17 


69 .7 


393 


SH3 


SH3 domain 


3 .5e-09 


43 . 9 


394 


AAA 


ATPases associated with various 
cellular act 


4.1e-2l 


83 .6 


396 


spectrin 


Spectrin repeat 


2.1e-67 


237.3 


397 


zi-C2H2 


Zinc finger, C2H2 type 


0.0066 


23.1 


399 


fn3 


Pibronectin type III domain 


4.1e-102 


352.6 


400 


WD40 


WD domain, G-beta repeat 


0.00049 


26.8 


401 


El dehydrog 


Dehydrogenase El component 


3e-119 


409.6 


402 


£n3 


Pibronectin type III domain 


0 


1719.6 


404 


LRR 


Leucine Rich Repeat 


2.1e-10 


48 .0 


40S 


cadherin 


Cadherin domain 


8.ie-8i 


281.9 


406 


zf-CXXC 


CXXC zinc finger 


5e-15 


63.4 


410 


RhoGEF 


RhoGEF domain 


i.le-23 


92.1 


411 


F-box 


F-box domain. ! 


4.2e-06 


33.7 


412 


SNF2_N 


SNF2 and others N- terminal 
domain 


5.8e-16 


61.6 


415 


CPSase L cha 
in 


Carbamoyl -phosphate synthase 
(CPSase) 


1.5e-172 


586.6 


418 


LRR 


Leucine Rich Repeat 


3.8e-24 


93.6 


419 


DENN 


DENN (AEX-3) domain 


2e-58 


207. S 


420 


RasGEF 


RasGEF domain 


8.1e-43 


155.7 


421 


ank 


Ank repeat 


1.4e-153 


523.7 


424 


G -patch 


G- patch domain 


le-19 


78.9 


425 


pkinase 


Eukaryotic protein kinase 
domain 


2.2e-31 


117.1 


426 


Plexin repea 
t 


Plexin repeat 


0.0023 


24.6 


427 


Plexin_repea 


Plexin repeat 


0.0023 


24.6 
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SKQ ID 
NO: 


PFAM NAME 


DESCRIPTION 


p -value 


PFAM 
SCORE 




C 








429 


zf-C3HC4 


Zinc finger, C3KC4 type (RING 
finger) 


8.6e-ll 


39.2 


431 


DEAD 


DEAD/DEAH box helicase 


le-66 


214.0 


432 


SH3 


SH3 domain 


3 ,4e-l6 


67.2 


433 


GTP CDC 


Cell division protein 


2*le-114 


393.5 


436 


Collagen 


Collagen triple helix repeat 
(20 copies) 


4 . 6e-194 


558. 1 


436 


Ricin_B_lect 
in 


Similarity to lectin domain of 
ricin b 


0 .0085 


10 _ 5 


441 


Alpha adapt i 
n_C 


Alpha adapt in carboxyl- terminal 
domai 


1 ,2e-256 


866 .0 


442 


Alpha_adapt i 
n_C 


Alpha adapt in carboxyl- terminal 
domai 


1. Be- 235 


795.7 


443 


PDZ 


PDZ domain (Also known as DHR 
or GLGF) . 


1. 9e-65 


230.9 


445 


LON 


ATP-dependent protease La (LON) 
domain 


0.00012 


-17.1 


446 


ig 


Immunoglobulin domain 


0 . 00011 


20 .1 


m 451 


sushi 


Sushi domain (SCR repeat) 


1.4e-18 


75.2 


452 


fn3 


Fibronectin type III domain 


1 .5e-06 


35.2 


454 


pyridoxal_de 
C 


Pyridoxal -dependent 
decarboxylase conse 


8 .3e-14 


50.3 


456 


kinesin 


Kinesin motor domain 


4 .9e-217 


734.4 


457 


neur_chan 


Neurotransmitter -gated ion- 
channel 


le-175 


597.1 


458 


Josephin 


josephin 


0.0002 


18.7 


468 


bziP 


bZIP transcription factor 


1.7S-07 


31.8 


470 


NTP_transfer 
ase 


Nucleotidyl transferase 


6.3e-06 


-26.3 


471 


WD40 


WD domain, G-beta repeat 


2e-28 


107.9 


473 


LIM 


LIM domain containing proteins 


0.00021 


20.7 


477 


zf-RanBP 


Zn- finger in Ran binding 
protein and others. 


0.028 


21.0 


473 


HD40 


WD domain, G-beta repeat 


6.5e-18 


73.0 


480 


KRAB 


KRAB box 


le-31 


11B.8 


481 


ArfGap 


Putative GTP -ase activating 
protein for Arf 


B.4e-66 


232.0 


485 


SH2 


Src homology domain 2 


0.011 


11.4 


486 


Clg 


dq domain 


4.3e-74 


259.6 


487 


dsrm 


Double- stranded RNA binding 
motif 


l.le-47 


171.9 


489 


zf-C2H2 


Zinc finger, C2H2 type 


4.8e-153 


521.9 


490 


Alpha_adapti 
njC 


Alpha adapt in carboxyl-terminal 
domai 


3.4e-222 


751.6 


492 


SKI 


Shikimate kinase 


1.2e-10 


48.8 


497 


KNV__polyprot 
ein 


ENV polyp rotein (coat 
polyp rot ein) 


2.6e~22 


77 . 6 


496 


abhydrol ase_ 
2 


Phosphol ipas e / Carboxyl es t erase 


0.041 


-48 . 1 


500 


rrm 


RNA recognition motif. 


5.4e-34 


126.4 


501 


WW 


WW domain 


4 . 6e-18 


73 . 4 


502 


ig 


Immunoglobulin domain 


l.le-10 


39.5 


504 


abhydrol aae 


alpha /beta hydrolase fold 


0 .045 


— ^—p 

-3.o 


505 


vwa 


von Willebrand factor type A 
domain 


7.le-62 


219.0 


508 


Na_K_ATPase_ 
C 


Na+/K+ ATPase terminus 


2.3e-145 


496.3 


509 


Exonuclease 


Exonuclease 


1.3e-56 


201.5 


510 


Glycos trans 
£ 1 


Glycosyl transferases group 1 


2.9e-06 


27.0 


511 


Glycos trans 
£ 1 


Glycosyl transferases group 1 


2.9e-06 


27,0 


512 


Glycos trans 


Glycosyl transferases group 1 


1.9e-09 


38.5 


514 


pro_asomeras 
e 


Cyclophilin type peptidyl- 
prolyl cis-tr 


1.8e-63 


221.4 
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SEQ ID 
NO: 


PPAM NAME 


DESCRIPTION 


p- value 


PFAM 
SCORE 


515" 


EGF 


EGF- like domain 


1.9e-18 


74 ,1 


SI 6 


Surp 


Surp module 


4.3e-38 


140.0 


523 


*9 


Immunoglobulin domain 


3.3e-06 


25.0 


526 


UBX 


UBX domain 


l.le-34 


128.6 


528 


adh_zinc 


Zinc-binding dehydrogenases 


2,7e-34 


127.4 


530 


SAM 


SAM domain (Sterile alpha 
motif) 


0.046 


10.0 


531 


adhjshort 


short chain dehydrogenase 


0.0025 


-34.1 


532 


mltq_carr 


Mitochondrial carrier proteins 


2.5e-81 


281.7 "* 


533 


mito_carr 


Mitochondrial carrier proteins 


2e-6l 


213 .5 


534 


thiolase 


Thiolase 


3.5e-183 


622 . 0 


535 


FMO-liXe 


Flavin-binding monooxygenase- 
like 


0 


1153 . 7 


536 


SCAN 


SCAN domain 


4e-55 


196". 6" 


537 


tRNA-synt_l 


tRNA synthetase? class I (I, I>, 
M and V) 


3.1e-136 


466.0 


538 


tRNA-synt 1 


tRNA synthetases class I (I, l* t 
M and V) 


3 .le-136 


466 . 0 


. 539 


tRNA-synt_l 


tRNA synthetases class I (I, L, 
M and V) 


1 . 9e-117 


403.6 


540 


tRNA-syntl 


tRNA synthetases class I (I, L, 
M and V) 


3 -le-136 


466 . 0 


541 


vATP-synt_E 


ATP synthase (E/31 KDa) subunit 


5.9e-B5 


295.7 


543 


zf-C2H2 


zinc finger, C2H2 type 


5.5e-£9 


242 


544 


DUP101 


Protein of unknown function 
DUF101 


8.5e-38 


13 9 . 0 


545 


TGFb_propept 
ide 


TGF-beta propeptide 


l.le-67 


238.2 


547 


WD40 


WD domain, G-be^a repeat 


2 . 6e~32 


120 8 


548 


RHD 


Rel homology domain (RHD) 


■ 1. 6e-238 


6B6.2 i 


549 


MMR_HSR1 


GTPase of unknown function 


5.4e-67 


236.0 


551 


HECT 


HECT -domain (ubiquitin- 
transferase) . 


4 . 3e-127 


435 6 


554 


MHCJ£I_alpha 


Class II histocompatibility 
antigen, alp 


3 . 5e-74 


259 _ 8 


555 


Zf-UBRl 


Putative zinc finger in N- 
recognin 


3.3e-16 


67.3 


556 


Kelch 


Kelch motif 


5.5e-29 


109.7 


561 


AMP -binding 


AMP -binding enzyme 


2 . 8e-06 


-l£3'."7 


562 


PABP 


Poly- adenylate binding protein, 
unique domai 


4 . 9e-38 


139.8 


564 


Gag_p3D 


Gag P30 core shell protein 


1.2e-67 


23 8 .2 


566 


PWWP 


PWWP domain 


B.le-16 


66.0 


567 


SCAN 


SCAN domain 


7.3e-68 


238.9 


569 


pkinase 


Eukaryotic protein kinase 
domain 


1.5e-84 


294.3 


570 


pkinase 


Eukaryotic protein kinase 
domain 


1.5e-84 


294.3 


571 


CN hydrolase 


Carbon-nitrogen hydrolase 


0.00081 


-79.? 


572 


myosin_Jhead 


Myosin head (motor domain J 


0 


1495.2 " 


573 


myosin_Jiead 


Myosin head (motor domain) 


0 


1490.4 


575 


Surp 


Surp module 


1.7e-23 


91.5 


576 


Surp 


Surp module 


1.7e-23 


91.5 


577 


DNAjpol B 


DNA polymerase family B 


0 


1138.6 


"578 


PDZ 


PDZ domain (Also known as DHR 
or GLGF) . 


8.3e-09 


42.7 


579 


LRR 


Leucine Rich Repeat 


4.9e-21 


83.3 


580 


neur_chan 


Neurotransmitter-gated ion- 
channel 


5.9e-177 


601.3 


583 


sushi 


SuBhi domain (SCR repeat) 


0 


1673.0 


584 


DEAD 


DBAD/DEAH box helicase 


7.3e-36 


116.3 


586 


KH-domain 


KH domain 


2.9e-13 


57.5 


587 


G-patch 


G-patch domain 


2.3e-14 


61.2 


599 


LIM 


LIM domain containing proteins 


2.3e-36 


133.4 


590 


broraodomain 


Bromodomain 


6.6e-32 


114.7 


591 


broraodomain 


Broraodomain 


6.6e-32 


114.7 
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SEQ XD 


AM NAME 


DB£>CRI trl IUP» 


p- value 


PFAM 
SCORE 


592 


hormone_rec 


Itigand- binding domain of 


3.5e-22 


87.1 


593 


PHD 


PHD- finger 


3.8e-12 


53.8 


594 


Ccidherin 


Cadherin domain 


4 . 2e-99 


342 . 7 






Eukaryotic protein kinase 

ucKrict 


t_ -Q'j 

be-92 


319 .2 


597 


WD40 


WLI UOniclX*! , Vj — J3«Ccl repeat 


n nnn^j 


to . / 


600 




f u-uftr repeat 


4 . je- /:> 


ADA . y 


602 


G_Adapt_CT 


Gamma -adapt in, C- terminus 


l.le-53 


191.8 


603 


pkioase 


Eukaryotic protein kinase 
domain 


2 . 3e-86 


300 . 4 


605 


Collagen 


Collagen triple helix repeat 
(20 copies) 


8e-42 


152 .4 


cnc 
oUo 


mxto car~ 


Mitochondrial carrier proteins 


6 ,3e-67 


232.3 


608 


PWWP 


pwwp domain 


2 . 6e-28 


107 - 5 


609 


PWWP 


PWWP domain 


2.6e-28 


107.5 


613 


CAP _G1»Y 


CAP-Gly domain 


0.0046 


20 .1 


615 


RFX_DNA_bind 
ing 


RFX DNA- binding domain 


S.2e-54 


192.9 


616 


kinesin 


Kinesin motor domain 


l.le-81 


284.8 


Sl7 


kinesin 


Kinesin motor domain 


8.4e-80 


278.5 


618 


zf-C3HC4 


Zinc finger, C3HC4 type (RING 
finger) 


0-0098 


13.1 


620 


MATH 


MATH domain 


7.8e-05 


22.2 


621 


Y_phosphatas 
e 


Protein- tyrosine phosphatase 


1.4e-32 


121.6 


622 


pkinaae 


Eukaryotic protein kinase 
domain 


4.4e-40 


146.6 


623 


BNR 


BNR repeat 


2.1B-11 


51.3 


624 


molybdopteri 
n 


Prokaryotic raolybdopterin 
oxidoreductas 


1.4e-12 


42.2 


625 


TPR 


TPR Domain 


l.le-17 


72.2 


627 


cNMPJbinding 


Cyclic nucleotide-binding 
domain 


3.7e-58 


206.6 


630 


adh_short 


short chain dehydrogenase 


5e-17 


70.0 


631" 


zf -C2H2 


Zinc finger, C2H2 type 


2.1e-88 


307.1 


632 


rrm 


RNA recognition motif . 


4e-05 


30.5 


635 


pkinase 


Eukaryotic protein kinase 
domain 


1.6e-104 


360.7 


636 


Fork_head 


Fork head domain 


5.9e~27 


103 .0 


637 


pkinaae 


Eukaryotic protein kinase 
domain 


3 -8e-70 


246.5 


642 


TPR 


TPR Domain 


4.8e-08 


40.1 I 


643 


einand. 


BF hand 


1 . 9e-27 


104 .6 


647 


SNr2__N 


SNF2 and others N- terminal 
domain 


1.2e-101 


351 .1 


64S 


PS eudoU_syn t 
h 2 


RNA poeudouridylate synthase 


1.9e-55 


197.6 


650 




linger, type 


0 . 0087 


22 . 7 


651 




hlU. IcpcaU 


1 " -S" -TW 


/l . J* 


652 


I LWEQ 


T / T > WPin Hnmn i n 
i/unciU uviudin 


9 i 5e-101 


J*± X . u 


653 


noilr r«Vi^»rt 


ivtiux u^iUii3iiu.LLcr cfaceu ion— 

r* Vi ?i ti ti#» 1 

^jllCUli its 


4 . le-171 


oox . D 


654 


tsp 1 


ThronihOfittrtiifii t\ hvnp 1 flnmairi 


4 ie-47 


169 . 9 


659 


FH2 


Pormi n Homo 1 nau 2 rinmn i n 


le-107 


371.2 


661 


pou 


Poll Hramin _ H-hArmin/)1 Frt 

*VU UXJiltcf_LJA — XV— LCXUiXIlclX CO 

homeobox domain 


5 . 3e-45 


162 . 9 


662 


C2 


C2 domain 


6.7e-19 


76.2 


663 


C2 


C2 domain 


6.7e-19 


76.2 


664 


C2 


C2 domain 


6.7e-19 


76.2 


667 


GST 


Glutathione S- transferases. 


9.3e-34 


114.4 


668 


LRR 


Leucine Rich Repeat 


9.3e-31 


115.6 


670 


spectrin 


Spectrin repeat 


4e-57 


203.2 


*71 


I_LWEQ 


IVLWEQ domain 


9.5e-101 


341.0 


672 


ABC tran 


ABC transporter 


5.3e-60 


212.8 


674 


WD40 


WD domain, G-beta repeat 


4.8e-24 


93.3 
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SEQ ID 
NO: 


PPAM NAME 


DESCRIPTION 


p- value 


PFAM 
SCORE 


675 


WD40 


WD domain, G-beta repeat 


4 .8e-24 


93.3 


676 


I.RR 


Leucine Rich Repeat 


0.0015 


25. 2 


679 


zf-CCCH 


Zinc finger C-x8-C-x5-C-x3-H 
type 


2.6e-29 


107.7 


680 


zf-C2H2 


Zinc finger, C2H2 type 


5.2e-05 


30.1 


681 


CH 


Calponin homology (CH) domain 


2.4e-17 


71.1 


682 


DSPc 


Dual specificity phosphatase, 
catalytic doma 


4.3e-43 


156.6 


683 


zf-C3HC4 


Zinc finger, C3HC4 type (RING 
finger) 


0.051 


10.8 


687 


Synapsis 


Synapsin 


0 


1890 -8 


689 


PR55 


Protein phosphatase 2A 
regulatory subunit PR 


0 


1038 .8 


691 


homeobox 


Homeobox domain 


B.5e-30 


112.4 


696 


Peptidase_M2 
4 


metal lopeptidase family M24 


2.6e~S9 


210 . 5 


697 


RhoGEF 


RhoGEF domain 


9.5e-35 


128.9 


698 


PHD 


PHD- finger 


0.008 


9.3 


701 


zf-C2H2 


Zinc finger, C2H2 type 


5.5e-123 


422,0 


702 


Sulf ataae 


Sulfatase 


3e-231 


781.6 


703 


zf-C2H2 


Zinc ringer, C2H2 type 


5.7e-20 


79.8 


707 


Acyl_transf 


Acyl trans f erase domain 


l.le-22 


68.8 


708 


WIMO 


WD domain, G-beta repeat 


4 .8e-l9 


76.7 


710 


Ran_BPl 


RanBPl domain. 


8.4e-06 


-7.3 


713 


DEAD 


DEAD/DBAH box helicase 


9.9e-42 


134.9 


714 


PH 


PH domain 


l.fie-09 


39.0 


715 


DSPc 


Dual specificity phosphatase, 
catalytic doma 


1.5e-37 


138.2 


717 


Sialyltranaf 


Sialyltransferase family 


7.5e-31 


115.9 


718 


ig 


Immunoglobulin domain 


le-29 


100.8 


719 


integrin_B 


Integrins, beta chain 


0 


1125.4 


720 


zf-C3HC4 


Zinc finger, C3HC4 type (RING 
finger) 


l.le-08 


32.4 


722 


Peptidase_C2 


Calpain family cysteine 
protease 


3e-14S 


495.9 


723 


ig 


Immunoglobulin domain 


2 .2e-05 


22.4 


724 


F-box 


F-box domain. 


0.007 


23.0 


725 


Nop 


Putative snoRNA binding domain 


8.ie-58 


205.5 


726 


Nop 


Putative snoRNA binding domain 


8.1e-58 


205.5 


727 


HD40 


WD domain, G-beta repeat 


7.5e-26 


99.3 


730 


dsrra 


Double- stranded RNA binding 
motif 


0.027 


12.1 


731 


dynamin 


Dynamin family 


4 .2e-16 


66.9 


733 


Zf-CCCH 


Zinc finger C-xb-c-x5-C-x3-h 
type 


2.8e-10 


41. 7" 


73 S 


CDP- 

OH_P_tranaf 


CDP- alcohol 

phosphatidyl transferase 


4.2e-26 j 


100.1 


738 


DEAD 


DEAD/DEAH box helicase 


8.6e-57 


182.5 


739 


TSC22 


TSC-22/dip/bun tamily 


6.Se-32 


119.5 


742 


ras 


Ras family 


2 .2e-10O 


346.9 


743 


PMI_typeI 


Phosphomannosc isomerase type I 


1.2e-243 


822.9 


747 


trypsin 


Trypsin 


6 .4e-88 


279 .4 


748 


kazal 


Kazal -type serine protease 
inhibitor domain 


2 .2e-52 


187 .4 


749 


efhand 


EP hand 


6 -3e-06 


33 . 1 


751 


PHD 


PHD- finger 


4 .9e-16 


66 .7 


752 




4iinc c ii lyer, \~&xi£ uype 




83 9 


753 


Hydrolase 


haloacid dehalogenase-Iike 
hydrolase 


6.1e-ll 


49.8 


754 


Ribosomal_L3 
9 


Ribosomal L39 protein 


0.00018 


26.7 


75S 


PH " 


PH domain 


3.6e-l4 


55.7 


758 


SCAN 


SCAN domain 


1.4e-53 


191.5 


759 


PA 


PA domain 


0.0065 


23.1 


760 


arf 


ADP-ribosylation factor family 


2.2e-l9 


77.8 


761 


CIDB-N ! 


C1DE-N domain 


2.2e-40 


147.6 
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SEQ ID 
NO: 


FFAM NAME 


DESCRIPTION 


p -value 


PFAM 


762 


histone 


Core histone H2A/H2B/H3 /H4 


9 . 9e-53 


JV.OO . O 


763 


zf-MYND 


MYND finger 


4 . le-14 


60 .3 


764 


pou 


Pou domain - N- terminal to 
home oho x domain 


le- 52 


J.OO t O 


767 


vwc 


von Willebrand factor type C 
domain 


2.9G-34 


127 .3 


769 


ef hand 


BF hand 


4.8e-ll 


50.1 


770 


zf-C4 


Zinc finger, C4 type (two 
domains ) 


2 .4e-53 


181 .6 


772 


ras 


Ras family 


7e-90 


312.0 


773 


Sulf atase 


Sulf atase 


le-142 


487.5 


77S 


zf -C2H2 


Zinc finger, C2H2 type 


l.le-12 


55.5 " 


776 


ZxT-C2H2 


Zinc finger, C2H2 type 


1 .le— 12 


55 . 5 


777 


zf-C2H2 


zinc finger, C2H2 type 


1 . le-12 


55 5 


778 


crm 


RNA recognition motif. 


2.1e-32 


121 1 ~ 


779 


G6PD 


1 nrri^F* — — riho^rvKa t* » 

dehydrogenase 


X . DC /O 


236 . 6 


780 


spectrin 




3.7e-29 


110 . 3 


781 






4 . 6e-57 


198 . 5 


782 


SCAM 


O V-JAil kXLHIUZ.XXX 


X . 


95 . 2 


783 


PDZ 


ku£* uQtiicixu \/visQ \nown as l/ixk 


4 . le-07 


37 . 1 


785 


DEAD 


DEAD/DEAH box heli case 


6e-Q6 


21.7 j 


786 


ras 


nets i- dim. j.y 


5 . 3e-39 


143 . 0 


787 


RKJane HIT 


K AJ-HJIllXCXCiJiJt* nil 


2 . 5e-67 


237 . 1 


730 


rij rut Jt ma 

3S 


Phosphatidyl inositol 3*- and 4- 

V" i cf>a 


5 .4e-l08 


372 .2 


795 


caclhf»T"1 n 


ParthpHn <"l#"wn;i "f yi 

^ fflUilCLXll UUUlaJ.Il 


2 5e-40 


147 . 4 


796 


ARID 


ARID DNA binding domain 


1.66-20 


81.6 


797 




Trypsin 


9 ■ 9e-20 


64 . 8 


799 


CH 


Calponin homology (CH) domain 


3 -7e-15 


63 . 8 


801 


Gal- 

bind lectin 


vertejorate gdiactosiue'-Dinainy 
lectin 


4 . le-25 


88 . 7 


803 


WD40 


WD domain, G-beta repeat 


0.00082 


26.1 


806 


TBC 


luO kXUUICtX*XX 


J. • oe*"zb 


101 . 4 


807 


TBC 


iuu uocnain 


1 . 8e-26 


101 . 4 


808 


CNhydrolase 


Carbon-nitrogen hydrolase 


8.8e-80 


278. S 


811 


r*HFD NPVR HM 

l-O* U .MMr I X3 Xli'J 
P 


ri-X4» LOue^XUCe LxoJlSCrxpclOR 

factor 


be- 14 


59 . 8 


812 


auii DiiwL l 


snore cixain txenycurogenase 


8.1e-20 


79 . 3 


814 


IMP4 


Domain of unknown function 


3 . 3e~71 


250 . 0 


815 


zf-C2H2 


Zinc finger, C2H2 type 


8.2e-66 


232.1 


816 


P«int* t-VUft "htr 
ir«5£Jt___t KRn Aiy 

dro 


Pep t idyl -t RNA hydrolase 


l.be-37 


13 6 . 0. 


817 


ARID 


ARID DNA binding domain 


2.5e-18 


74.3 


826 


IPS eIF4 elF 
2 ~ ~~ 


aTpA «- fTaiimia / #>TK*tv /p7 — on a 4 1 r~m 
CXX T* ydOOnd/cxC3/Cir« cpoHLUl 


1 CTc» _ -> O 

x • oe-.j<it 


121 . 5 


830 


ArfGap 


Pufcahivf* CJTP-anp art" i vat" i nrr 
protein for Arf 


1 5e^q^l 


1:7 -L, . 3 


831 


LRR 


Leucine Rich Repeat 


2 . le-26 


1U1 . 1 


832 


laminin EGF 


Laminin EGF- like (Domains III 
and V) 


2e-57 


204 2 


839 


rrm 


RNA recognition motif. 


1.3e-22 


88 . 5 


840 


Y_phosphatas 
e 


Protein- tyrosine phosphatase 


2.6e-H9 


409! 8 


841 


pkinase 


Rukarvof ic nrrifpln Icina^f* 

uum4X JULAV- ^VwCXil AJ41CIDC 

domain 


3 . 46-100 


jlo ■ J 


844 


Ribosomal L2 
2e 


Ribosomal I»22e protein family 


le-eU 


228.4 


846 


IBR 


IBR domain 


9e-15 


62.5 


849 


zr-C3HC4 


Zinc finger, C3HC4 type (RING 
finger) 


7.46-07 


26.5 


850 


zf-C3HC4 


Zinc finger, C3HC4 type (RING 
finger) 


0.00016 


18.9 


851 


SET 


SET domain 


5e-30 


113.2 


852 


SRCR j 


Scavenger receptor cysteine - 


0 


1025.4 
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SEQ ID 
NO: 


PPAM NAME 


DESCRIPTION 


p- value 


PFAM 
SCORE 






rich domain 






853 


SRCR 


Scavenger receptor cysteine- 
rich domain 


0 


1025.4 


857 


lactaraa3e_B 


Metal lo-beta- lactamase 
super family 


0.012 


-6.0 


858 


COX6A 


cytochrome c oxidase subunit 
Via 


3.4e-58 


206.7 


B59 


rrm 


RNA recognition motif. 


5.4e-45 


162.9 


. 861 


PRK 


Phosphoribulokinase 


5.ie-62 


219 .4 


863 


mito_carr 


Mitochondrial carrier proteins 


2.9e-53 


185.5 


864 


HSP90 


Hsp90 protein 


4 . 7e-l58 


538 .5 


866 


A 9 


Immunoglobulin domain 


4e-12 


44 . 1 


867 


zf-C2H2 


Zinc finger, C2H2 type 


7e-135 


461.5 


872 


hi stone 


Core hlstone H2A/H2B/H3/H4 


4 ,9e-41 


149 8 


874 


CpSaseL cha 
in 


Carbamoyl -phosphate synthase 
(CPSase) 


Cm . JLtS *" 4> X O 


739 . 0 


879 


Ribosomal SI 
2e 


Ribosomal protein S12e 


2.1e-98 


340.3 


882 


serpin 


Serpins (serine protease 
inhibitors) 


2.5e-42 


145.7 j 


883 


Patatin 


Patatin 


1 . 2e-51 


182 0 


884 


RA 


Ras association (RalGDS/AF-6) 
domain 


0 044 


8.0 


887 


DUF92 


Integral membrane protein DUF92 


2 . 7e-12 


54 . 3 


889 


sugar_tr 


^ugar (and other) transporter 


8 . 2e-63 


222 . 1 


893 


DUF28 


Domain of unknown function 
DUP28 


1 . 3e~43 


ICQ 1 


896 


lP_trans 


Phosphatidylinositol transfer 
protein 


6 . 5e- 98 


338 . 7 


898 


DEAD 


DEAD/DEAH box helicase 


1. 5e-48 


156 .5 


899 


KK2 


KE2 family protein 


7e-61 


21S .7 


900 


KB 2 


KB2 family protein 


4.3e-5l 


183.2 


901 


zf-C2H2 


Zinc finger, C2H2 type 


2 ,7e-57 


203 . 8 


902 


ras 


Ras family 


2 .3e-75 


263 . 8 


904 


TPR 


TPR Domain 


3 . 2e-22 


87 . 2 


906 


GBP 


Guanylate-binding protein 


8 . 9e-253 


853 . 1 


907 


GBP 


Guanylate-binding protein 


l.le-239 


809.6 


90B 


WD40 | 


WD domain, G-beta repeat 


2 .6e-26 


100 . 8 


909 


PH 


PH domain 


1.3e-09 


39.4 


910 


zf-C2H2 


Zinc finger, C2H2 type 


2.5e-39 


144 . 1 


913 


Epimerase 


NAD dependent 

epimerase/dehydratase family 


5e-07 


-88 .5 


921 


TBC 


TBC domain 


1 .5e-09 


30.7 


922 


WD40 


WD domain, G-beta repeat 




98.2 


923 


WD40 


WD domain, G-beta repeat 


8 .2e-07 


36 .1 


924 


Hydrolase 


haloacid dehalogenase-like 
hydrolase 


2.9e-05 


29,1 


925 


OQ con 


Ubiquitin-conjugating enzyme 


0.00033 


-27.6 


926 


CH 


Calponin homology (CH) domain 


3.3e~53 


190.2 


928 


WD40 


WD domain, G-beta repeat 


5.9e-48 


172 .7 


929 


zf-C3HC4 


Zinc finger, C3HC4 type (RING 
finger) 


3.1e-10 


37.4 


930 


Ribul_P_3_ep 
im 


Ribulose-phosphate 3 epimerase 
family 


7.2e-105 


361.8 


931 


Ribul_p_3_ep 
ira 


Ribulose-phosphate 3 epimerase 
family 


1.2e-96 


334.4 


936 


C2 


C2 domain 


2.2e-62 


220.7 


937 


NAP__family 


Nucieosome assembly protein 
(NAP) 


l.le-22 


84.6- 


940 


abhydrolase 


alpha/beta hydrolase fold 


0.011 


3.1 


944 


Tropomyosin 


Tropomyosins 


3.2e-07 


25.1 


948 


pkinase 


Eiticaryotic protein kinase 
domain 


3.4e-75 


263.2 


949 


WD40 


WD domain, G-beta repeat 


1.8e-27 


104.7 


950 


Acyl transfer 
ase 


Acyl transferase 


1.6e-07 i 


38.4 
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SEQ ID 
NO: 


PFAM NAME 


DESCRIPTION 


p-value 


PFAM 
SCORE 


951 


SAM 


SAM domain (Sterile alpha 
motif) 


0.014 


14 .5 


954 


GFO_IDH MocA 


Oxidoreductasa family 


l-.3e-ll 


52 » 0 


955 


BTB 


BTB/POZ domain 


7e-22 


86.1 


956 


BTB 


BTB/POZ domain 


7e-22 


-v-s — 

86 .1 


957 


CDP- 

OH P transf 


CDP- alcohol 

phosphatidyl transferase 


0.O53 


-22 . 2 


959 


ras 


Ras family 


2.4e-97 


336.8 j 


960 


ras 


Ras family 


8.4e-43 


155 . 6 


961 


Acetyl transf 


Acetyl transferase (gnat) family 


1.2e-08 


42.2 


962 


adh_short 


short chain dehydrogenase 


2.4e-31 


117.6 


963 


mutT 


Bacterial mutT protein 


5.6e-06 


26.2 


969 


IF-2B 


Initiation factor 2 subunit 
family 


8»4e-193 


653.9 


970 


RNaseJPH 


3' exorlbonuclease family 


9e-24 


92.4 


975 


WW 


WW domain 


5.7e-2S 


96.4 


977 


PDZ 


PDZ domain (Also known as DHR 
or GLGF) . 


3.6e-21 


83.7 


978 


Ribosomal Ll 
7 


Ribosomal protein L17 


2.4e-20 


81.0 


979 


LIM 


LIM domain containing proteins 


5.8e-42 


152.8 


980 


Calsequestri 
n 


Ca 1 seques tr in 


1.7e-297 


1001.7 


982 


HSP20 


Hsp20 /alpha crystallin family 


1.2e-10 


43.2 


983 


oxidored__q6 


KADH ubiquinone oxidoreductase, 
20 Kd sub 


4.8e-63 


222.9 


988 


TBC 


TBC domain 


2.2e-50 


180.8 


989 


TBC 


TBC domain 


2.2e-50 


180.8 


993 


tRNA_int_end 
o 


tRNA intron endonuclease 


0.0017 


-34.2 


994 


homeobox 


Homeobox domain 


4e-l8 


73.6 


997 


pyr redox 


Pyridine nucleotide-di sulphide 
oxidoreducta 


0.012 


11.6 


1000 


mito carr 


Mitochondrial carrier proteins 


9.7e-123 


421.2 


1001 


RA 


Ras association (RalGDS/AF-6) 
domain 


1.2e-15 


65.4 


1004 

- 


- DUF81 


Domain of unknown function 
DUF81 


0.099 


10.2 


1005 


actin 


Actin 


1.3e-174 


574.3 


1006 


actin 


Actin 


3.1e-130 


428.6 


1007 


cpn60_TCPl 


TCP-l/cpn60 chaperonin family 


3 .7e-195 


661.8 | 


1008 


TPR 


TPR Domain 


8 . le-44 


159.0 


1009 


gf-C2H2 


Zinc finger, C2H2 type 


3 .6e-61 


216.6 


1011 


zt-C2H2 


Zinc finger, C2H2 type 


3.6e-61 


216.6 


1012 


Zf-C3HC4 


Zinc finger, C3HC4 type (RING 
finger) 


4 .7e-15 


53 .1 


1016 


tRNA- eynt_2c 


tRNA synthetases class II (A) 


2 .3e-15 


55.2 


1018 


RhoGAP 


RhoGAP domain 


1 .6e-78 


274.3 


1022 


PGAM 


Phosphogiycerate mutase family 


3-8e-18 


69.7 


1026 


HMGJbox 


HMG~ (high mobility group) box 


8.4e-20 


79.2 j 


1027 


TBC 


TBC domain 


7.3e-45 


162.5 


1028 


0Q_con 


Ubiquit in- conjugating enzyme 


1.4e-49 


178.1 


1032 


PDZ 


PDZ domain (Also known as DHR 
or GLGF) . 


0 .028 


16 .3 


1034 


Hydrolase 


haloacid dehalogenase-iike 
hydrolase 


2e-21 


84 .6 


1037 


KRAB 


KRAB box 


4 - 8e-06 


■SA . 4 


1038 


Cation_efflu 

X 


Cation efflux family 


7-le-42 


152.5 


1040 


ART 


NAD:arginine ADP- 
ribosyltransf erase 


4-7e-47 


169.1 


1042 


WD40 


WD domain, G-beta repeat 


1.9e-l8 


74.7 


1043 


zf-C2H2 


Zinc finger, C2H2 type 


3.7e-24 


93 .7 


1045 


lectin_c 


Lectin C~type domain 


1.9e-28 


108.0 


1046 


Glue osamine_ 
iso 


Glucosamine-6 -phosphate 
isomerase 


0.00013 


-25.1 
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SEQ ID 
NOr 


PFAM NAME 


DESCRIPTION 


p-value 


PFAM 
SCORE 


1047 


ligage-CoA 


CoA-li gases 


4 .5e-80 


279 .4 


1049 




Immunoglobulin domain 


1.7e-09 


35.6 


1050 


Ribosomal__L2 
4e 


Ribosomal probein L24e 


2e-33 


124 .5 


1054 


Amidase 


Amidase 


4.3e-i52 


518 .7 


1055 


ITTO 


rna recognition motif. 


3 . 8e~26 


100 .3 


1058 


annexin 


Annexin 


6. 9e-44 


159 .2 


1059 


PMP22_Ciaudi 
n 


PMP-22 /EMP/MP2 O/Claudin family 


0.023 


-23 .6 


1060 


hpmeobox 


Homeobox domain 


3 , 2e-31 


117 .2 


1062 


Acyltransfer 
aae 


Acyl transfer ase 


0 .00065 


T7\ — F 

10 .5 


1064 


AMP-binding 


AMP -binding enayme 


6 ,6e-l00 


345.3 


1065 


LRR 


Leucine Rich Repeat 


3 .3e-14 


60 . 6 


1066 


GTPljDBG 


GTPi/OBG family 


4.8e-41 


141.8 


1071 


ig 


Immunoglobulin domain 


8.4e-48 


159.1 


1072 


PHD 


PHD- finger 


6.Be-07 


36.3 


1074 


DENN 


DENN (AJEX-3) domain 


8.3e-33 


121.5 


1075 


SCP 


SCP-like extracellular protein 


4.7e-41 


149.8 


1077 


OLP 


Olfactomedin-like domain 


2.2e-66 


234.0 


1078 


mi to carr 


Mitochondrial carrier proteins 


ie-42 


149.3 


1079 


WD4Q 


WD domain, G-beta repeat 


6 .2e-45 


162.7 


1087 


START 


START domain 


1.5e-48 


174.7 


1093 


DSPC 


Dual specificity phosphatase, 
catalytic doma 


3.3e-63 


223.4 


1094 


GSHPx 


Glutathione peroxidases 


9.6e-41 


148.8 


1095 


DUF25 


Domain of unknown function 
DUF25 


2e-75 


264.0 


1096 


.DUF25 


Domain of unknown function 
DUF25 


6e-^ 


262.4 


1105 


Nitroreducta 
se 


Nitroreductase family 


1.3e-l3 


58.6 


1106 


PTE 


Phosphodiesterase family 


1.3e-179 


610.1 


1107 


DAGKc 


Diacylglycerol kinase catalytic 
domain 


0.00049 


19.6 


1109 


raff 


Ras family 


1.3e-15 


40.7 


1115 


ArfGap 


Putative GTP-ase activating 
protein for Arf 


9.7e-47 


168.7 


1116 


HMG14_17 


HMG14 and HMG17 


4 .4e-2l 


83.5 


1117 


HMG14JL7 


HMG14 and HMG17 


9.9e-12 


52.4 


1119 


FAA_hydroIas 
e 


Fumarylacetoacetate (FAA) 
hydrolasB fam 


2e-83 


290.6 


1120 


pkinase 


Bukaryotic protein kinase 
domain 


1.4e-94 


327.6 


1123 


abhydrolase 


alpha/beta hydrolase fold 


9.2e-23 


89.0 


1129 


pro i some ras 
e 


Cyclophilin type pep tidy! - 
prolyl cis-tr 


2.2e-56 


197.1 


1131 


DnaJ 


DnaJ domain 


1.6e-30 


114.9 


1132 


WD40 


WD domain, G-beta repeat 


1.3e-19 


78. 6 


1133 


WD40 


WD domain, G-beta repeat 


1.8e-15 


64.9 


1134 


PH 


PH domain 


0.0015 


17.8 


1136. 


Adap comp su 
b 


Adaptor complexes medium 
subunit family 


l,2e-256 


866 . 0 


1137 


Adap_comp_su 
b 


Adaptor complexes medium 
subunit family 


2.5e-209 


708 .8 


1139 


ras 


Ras family 


1.5e-86 


301 .0 


1141 


pkinuoe 


Bukaryotic protein kinase 
domain 


9 . 4e- 74 


- * 


1152 


Acyl transfer 
ase 


Acyl t rans f era se 


1.2e-05 


29-9 


1153 


IRS 


PTB domain ( IRS- 1. type) j 


5.4e-55 


196.1 


1155 




Immunoglobulin domain 


1.3e-31 


106.9 


1157 


Asparaginase 
2 


Asparaginase 


6.4e-72 


252.3 


1159 


GMC_oxred 


GMC oxidoreductases 


4.7e-142 


485.3 


1160 


zf-ANl 


ANl-like Zinc finger 


0.00021 


27.9 
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SBQ ID 
NO: 


PFAM NAME 


DESCRIPTION 


p -value 


PFAM 

SCORE 


1X63 


linker_histo 
ne 


linker hi stone Hi and H5 family 


3 . 8e-14 


60 .4 


1164 


DED 


Death effector domain 


3 . 9e-0S 


30.5 


1165 


IRS 


PTB domain (IRS-1 type) 


2.6e-43 


157.3 


1166 


IRS 


PTB domain (IRS-1 type) 


2.6e-43 


157 .3 


116B 


SAM 

• 


SAM domain (Sterile alpha 
motif) 


0 . 04 


10 .5 


1170 


abhydrolase 


alpha/beta hydrolase fold 


O.098 


-7 .5 


1174 


SAP 


SAP domain 


3.9e-l0 


47 . 1 


1177 


PP2C 


Protein phosphatase 2C 


5.3e-31 


112 .5 


1178 


WD40 


WD domain, G-beta repeat 


4-7e-35 


129.9 


1180 


Ets 


Ets-domain 


1.8e-09 


33 .3 


1181 


Collagen 


Collagen triple helix repeat 
(20 copies) 


0. 00016 


24.7 


11B2 


TO>l_MTCPl 


TCia/MTCPl family 


9.5e~56 


198.6 


1184 


RasGEF 


RasGBF domain 


1.7e-88 


307.4 


1185 


mito^carr 


Mitochondrial carrier proteins 


1.5e-62 


217.3 


1187 


UPARLY6 


u-PAR/Ly-6 domain 


0.0042 


15.6 


1188 


Om_DAP_Axg_ 
dec 


Pyridoxal-dependent 
decarboxylase 


6.2e-128 


430.6 


1193 


Stathmin 


Stathmin family 


1.8e-90 


314.0 


1194 


Stathmin 


Stathmin family 


1.8e-90 


314.0 


1195 


Seel 


Seel family 


3.2 e -l83 


622.1 


1196 


pyrjredox 


Pyridine nucleotide -di sulphide 
oxidoreducta 


3.1e-32 


in:& 


1197 


Glyco_transf 


Glycosyl transferase family 8 


1.2e~09 


45.5 


1202 


K_tetra 


K+ channel tetramerisation 
domain 


0.022 


-16.8 


1203 


adhshort 


short chain dehydrogenase 


8.3e-45 


162.3 


1206 


Ubie methyl t 
ran 


ubiE/C0Q5 methyltransf erase 
family 


1.3e-l21 


417.4 


120B 


7tm_3 


7 transmembrane receptor 


7.2e-09 


29.0 | 


1209 


ank 


Ank repeat 


3.9e-l5 


63 .7 


1210 


VATP- 
synt_AC39 


ATP Synthase (C/AC39) subunit 


2.5e-128 


439.7 


1212 


zf-C2H2 


Zinc finger, C2H2 type 


5. 5e-17 


69.9 


1213 


efhand 


BF hand 


3.2e-07 


37.4 


1219 


rrra 


RNA recognition motif. 


2.1e-40 


147.7 


1220 


DUF6 


Integral membrane protein DUF6 


0.015 


21.5 


1222 


SCAN 


SCAM domain 


i.5e-71 


251.1 


1223 


G- gamma 


GGXi domain 


3.6e-36 


129.5 


1227 


catalase 


Catalase 


0 


1158 . 9 


1232 


PX 


PX domain 


2.2e-15 


64.5 


1233 


PX 


PX domain 


2.ie-15 


64.5 


1236 


FCH 


Fes/CIP4 homology domain 


3.3e-09 


44.0 


1241 


Peptidase M2 
0 


Peptidase family M20/M25/M40 


2e-63 


224 .1 


1243 


WW 


WW domain 


0.044 


17.9 


1247 


UPF0006 


Metalloenzyme of unknown 
function TTPF0006 


6.3e-61 


215.8 


1248 


Glycos_trans 
t_2 


Glycosyl transferases 


4.5e-10 


46.9 


1249 


efhand 


BF hand 


4e-ll 


50.4 


1254 


DQ_con 


Ubiqui tin- conjugating enzyme 


2.1e-73 


257.3 


1255 


raa 


Has family 


2.2e-62 


220 .7 


1256 


formyl trans 
f 


Formyl transferase 


4 . 9e-30 


108 .3 


1259 


Zf-C3HC4 


Zinc finger, C3HC4 type (RING 
finger) 


5.3e-13 


46.4 


1261 


DiKfolate re 
d 


Dihydrofolate reductase 


2.1e-69 


241.7 


1262 


G_glu__transp 
ept 


Gamma-glutamyltranspept idase 


l.Se-110 


380.4 


1263 


PAS 


PAS domain 


1.3e-08 


36.9 


1265 


LRR 


Leucine Rich Repeat 


4.2e-22 


86.9 
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SEQ ID 
NO: 


"API vi*\Flr> 




p- value 


PFAM 


1266 


SCP 


SCP-like extracellular protein 


6e-29 


108.0 


3.267 




XT m nhnnnol t"pt"ramf>T*"i ctkhfrm 

domain 


2 - 8e-27 


104 . 0 


1269 


XdLS 


Daq f ami 1 V 


X . JC 03 


297 . 9 


127? 


2?-C3*HC4 


7inc f inner C3HC4 tvop fRTNf: 
finder) 


% • AC" JLU 


n n 


1276 


a b hydrolase 


»1 /bf»ta hvciifolase "Folti 


5 . 4e-23 


oj - a 


1277 


abhydrolase 


alpha/beta hydrolase fold 


5.6e-21 


83.1 


1279 


Li y^JS> 111 




4 .4e— 41 


132 . 0 


1280 


PBP 


Phosphat xdyle tlianolaniine - * 


1 . 3e~13 


58 . 7 


1285 




xtXil^- .LAJlyCJL, ^ JXH— i type IKJLLiV? 




49 .6 


1287 


ank 




— — — _ 

1 . 7e-b2 


5 

187 . 8 


1294 


£n3 


hrrtnori* { n t"xrr\e» TTT rffwnJi ■! *i 

jp xjji. yncttAii uypts in uvuicixii 


0 026 


20 . 9 


1295 


r>DT) 


Guanylate -binding protein 


U . (JUU^o 


-70 . 0 


1296 


PMP22_Claudi 
n 


PMP-22/EMP/MP2G/Claudin family 


6.9e-41 


149.3 




Rhodane s e 


Rhodaneee- Ixke domain 


3 - 2e-14 


60 . 7 


1298 


LIN 


LIM domain containing proteins 


5.8e-21 


79.1 




maseA 


Pancreatic ribonucleases 


4 . 9e-43 


145.2 


1307 


mito_carr 


Mitochondrial carrier proteins 


2.1e-53 


186.0 


1308 


WD40 


WD domain, G-beta repeat 


1.6e-17 


71.6 


13X0 


tXPAR LY6 


u-PAR/Ly-G domain 


7.1e-20 


75.5 


1313 


thiored 


Thioredoxin 


3 .6e-05 


21.6 


1314 


Aa trans 


Transmembrane amino acid 
transporter protein 


l.Se-67 


237.9 


1316 


trypsin 


Trypsin 


4 .4e~41 


132.0 


1320 


Ribosomal i»l 
3 


Ribooomal protein 1*13 


3 . 9e-62 


219.8 


1327 


Armadil 1 o_s e 

g 


Armadillo/beta-catenin-like 
repeats 


0 . 0054 


23.4 


1328 


KRAB 


KRAB box 


0.052 


-5.6 


1329 


ran 


RNA recognition motif. 


2.1e-40 


147.7 


1330 


Bcl-2 


Apoptosis regulator proteins, 
Bcl-2 family 


0.014 


-1.6 


1331 


PX 


PX domain 


2.1e-10 


48.0 


1333 


KRAB 


KRAB box 


1.8e-36 


134.6 


1334 


UPP_syntheta 
Be 


Putative undecaprenyl 
diphosphate synt 


2.3e-89 


310.3 


133 5 


UJr F_syncneta 
se 


Putative undecaprenyl 
diphosphate synt 


1.8e-59 


211.0 


1336 


JJoJf C 


Dual specificity phosphatase, 
catalytic doma 


1 , 2e-31 


118 .6 


1337 


DSPc 


Dual specif xcxfcy phosphatase, 

/-• a +- a "1 vrr* ■? r r)/>m,i 


2 . 3e-12 


54 . 5 


1338 


TPR 






28 1 


1340 


metal thio 


Metailothionein 


0 . 013 


20.3 


1341 


mutT 


Bacterial mutT protein 


5.8e-09 


36.5 


1343 


Band 41 


PERM rinmniTi f Ar*nH A 1 fami 1 \/ ^ 


x . je-jo 


122 . 5 


1344 


Kelch 


Kelch motif 


1.4e-44 


161.5 


1345 




TV f H fvAOTo nrYiHui n 

fTdi u j. a. jlcc pruucin 


l . 2e _ 10 


48.8 


1347 


3Beta HSD 


"3 _ V>£» f- 71 hvriTYivuaherni' rl 
dfthvdroapna ro- / i rtrwnf* 


0 086 


_ 1*7*7 *5 i 


1348 


BTB 


BTB/P0Z domain 


5 .3e-28 


106 . 5 


1349 


DUP6 


■*"twy*.ax uiClllui-cUiC ^*Alsl>sPAAl fVCD 


0 033 


1C Q 

AO . o J 


1350 


myosin__head 


Myosin head (motor domain) 


0 


1088.7 


1352 


Nrarap 


Natural resistance-associated 
macrophage pro 


1.2e-202 


686.6 


1353 


S 100 


S-100/ICaBP type calcium 
binding domain 


5.3e-23 


89.9 


1355 


DEAD 


DEAD/DEAH box helicase 


3.6e-65 


209.0 


1356 


C2 


C2 domain 


2.4e-15 


64.4 


1357 


RBD 


Raf-like Ras -binding domain 


4.2e-57 


203.1 


1360 


Zf-C2H2 


Zinc finger, C2H2 type 


7.4e-14l 


481.4 


1361 


HMG14_17 


HMG14 and HMG17 


7.9e-40 


145.7 
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SBQ ID 
NO: 


PF7VM NAME 




p '-value 


PFAM 


1362 


SIS 


SIS domain 


3 . 8e-30 


113 6 


1363 


SIS 


SIS domain 


1 . 3e-28 


inn r 

JLU 0 . D 


1364 


ig 


Immunog lobul in domain 






1366 


K_tetra 


K+ channel tetraroerisation 
domain 


1 . le-16 




1371 


Collagen 


Collagen triple helix repeat 
(20 copies) 


2 . 2e-113 


ion i 


1372 


DnaJ 


Dna*7 domain 


6 . 6e-36 




1376 


KRAB 


KRAB box 


2 le-38 


141 0 


1378 


ELM2 


EiM2 domain 


2e-23 


91 .3 


1386 


thiored 


Thioredoxin 


1 .2e-23 


82 8 


1381 


ank 


Ank repeat 


2 »3e-83 


t ?u . ** 


1382 


BTB 


BTB/P0Z domain 


3e-ll 


50 . 8 


1383 


WD40 


WD domain R-hp*t~;i y**»t-^p»»I- 


1 6e-19 


78 . 3 


1384 


WD40 




{> . 3e-24 


92 .9 


1387 


zf -C3HC4 


Zinr* f i napr Pivrnd f-vn^ /i3THr; 
" J -* lk -> i-iiyci , ^-jrif* type ikaivlj 

finger) 


1 . le-09 


35 . 6 


1389 


zf -C2H2 




K Co Cfi 

t> . >e 3t» 


179 . 5 


1390 


zf-C2H2 


Zinc finger, C2H2 type 


2.5e-85 


296.9 


1393 


kineoin 


jvincsxix motor aonoiii 


/ . 8e-188 


637 .4 


1394 




6j.hl. tuigei/ v^iu^ type 


l_2e-49 


178.4 


1398 


KRAB 


KRAB box 


5.1e-22 


86.6 


1402 


K7TD 


bZIP transcription factor 


0 .035 


13.1 


1405 


sugar fcr 


Sugar (and other) transporter 


0 .003 


-101 .5 


1406 




RhoGAP domain 


8.9e-47 


168.8 


1407 




RNA recognition motif. 


le-35 


132 .1 


1408' 


LRR 


Leucine Rich Repeat 


2 .ie-13 


58 .0 


1409 


Nebulin repe 
at ~~ 


Nebulin repeat 



6e-54 


192.6 


1410 




Ank repeat 


1.6e-17 


71.6 - 


1412 


riAJ-«J£>5JlllCl J. l-O 

JZ 


rrDosotnax iop ramiiy c-terminus 


8 _2e-58 


205.5 


1415 




Trypsin 


4 . 7e-85 


270 .4 


1416 


aminotranl 


Aminotransferases class- 1 


4.4e-05 


-91.2 


1417 


SI 


si kna binding domain 


1. 6e-C7 


33.1 


1419 


WD40 


WD domain, G-beta repeat 


2 . 2e-09 


44 .6 


1422 




Cadher in doroa in 


8.3e-42 


1S2.3 


1424 


SH3 


oil j uouidin 


2 . 5e-B0 


280.3 


1425 


PHD 


JrllU Cluy ejr 


3 . 2e-17 


70.6 


1426 


PHD 


f riu- 1 mg er 


3 .2e-l7 


70 .6 


1427 


ArfGap 


Putative GTP-ase activating 

piv»l.CJ.H JL UI Mi J_ 


le-3? 


138.8 


1428 


helicase C 


xicixcdocs conuervcu terminal 


le-26 


102.2 


1429 


WD40 


HI) dnim'tn d~l"u»hj» mMnaat* 




3 . 9e— 07 


37.2 


1430 


inositol P 


Inositol monoDhofinhs h » -Fam-i lv 


2 Se— 10 


A f\ *> 
*k V . Z 


1431 


raito carr 


Mitochondrial carrier proteins 


4 3e— 83 


AO/./ 


1433 


Clq 


Clq domain 


2 . 9e-16 


66 . 2 


1434 


WD40 


WD domain, G-beta repeat 


1 . 6e-13 




1435 


Inos-l- 
P_synth 


Myo - inos i t ol - 1 -phospha t e 
synthase 


7e-228 


770 . 4 " 


1436 


rrm 


RNA recognition motif. 


1.4e-34 


128 .3 


1438 


ig 


Immunoglobulin domain 


1.3e-12 


45.6 


1440 


G_Adapt_CT 


Gamma-adapt in, C- terminus 


3 .4e-67 


236 . 7 


1441 


G_Adapt_CT 


Gamma-adapt in , C- terminus 


3 . 4e-67 


236.7 


1443 


Kelch 


Kelch motif 


0.00013 


28.7 


1446 


ARID 


ARID DNA binding domain 


1 . 8e-2i 


84.7 


1447 


zt-C2H2 


Zinc finger, C2H2 type 


9.4e-28 


105.6 


1448 


AMP-binding 


AMP-binding enzyme 


2.6e-07 


-145.1 


1451 


rrm 


RNA recognition motif. 


6.5e-21 


82.9 


1454 


*g 


Immunoglobulin domain 


5.6e-44 


146.7 j 


1455 


Sialyl transf 


Sialyl transferase family 


5.4e-21 


83.2 


1460 


Aldoseepim 


Aldose l-epimerase 


1.9e-35 


131.2 


1461 


C2 


C2 domain 


4e-18 


73 .6 


1470 


TIG 


IPT/TIG domain 


3.1e-19 


77.3 


1472 


PseudoU_synt 


RNA pseudouridylate synthase 


4.3e-16 


66.9 
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SEQ ID 
NO: 


PFAM NAME 


DESCRIPTION 


p- value 


PFAM 
SCORE 




h_2 








14-7 4 


DENN 


DENN (AEX-3 ) domain 


1.3e-44 


161.6 


1475 


Cation_ef flu 

X 


Cation efflux family 


4.6e-49 


176.4 


1477 


TBC 


TBC domain 


Be-47 


169.0 


1478 


rrm 


RMA recognition motif. 


2e-21 


84.6 


1480 


ig 


Immunoglobulin domain 


5.5e-06 


24.3 


1484 


TeloJbind_al 
pha 


Telomere-binding protein alpha 
subuni 


0.028 


-225.9 


1485 


2f-C2H2 


Zinc finger, C2H2 type 


1.8e-68 


240.9 


1486 


pkinase 


Eukaryotic protein kinase 
domain 


9.5e-13 


49.9 


1488 


helicase_C 


Helicaaes conserved c- terminal 
domain 


1.4e-15 


65.2 


1489 


DUF89 


Protein of unknown function 
DUF89 


0.079 


-132.4 


1490 


ECH 


Enoyl-CoA hydratase/i3omera3e 
family 


5.2C-41 


149.7 


1491 


guanylate_cy 
c 


Adenylate and Guanylate cyclase 
catalyt 


5.9e-46 


166.1 


1492 


IiRR | 


leucine Rich Repeat 


3.4e-19 


77.2 


1495 


zf-C3HC4 


Zinc finger, C3HC4 type (RING 
finger) 


7.1e-10 


36.3 


1497 


pkinase 


Eukaryotic protein kinase 
domain 


le-22 


85.8 


1500 


SH3 ] 


SH3 domain 


9.3e-05 


27.2 


1502 


homeobooc 


Homeobox domain 


D.0B4 


13.8 


1503 


homeobox 


Homeobox domain 


0.084 


13.8 


1505 


EGF 


EGF -like domain 


2.7e-23 


90.8 


1506 


UCH-2 


Ubiguitin carboxyl -terminal 
hydrolase family 


2 .7e-21 


84.2 


1508 


Peptidase M2 
0 


Peptidase family M20/M25/M40 


2.8e-28 


101.8 


1511 


PX 


PX domain 


1.9e-ll 


51.5 


1512 


Sulf atase 


Sulfatase 


2.8e-35 


130.7 


1516 


Syntaxin 


Syntaxin 


0.011 


-62.3 


1518 


aminotran_3 


Aminotransferases class- III 
pyridoxal-pho 


9.7e-106 


305-6 


1520 




Immunoglobulin domain 


0.075 


11.0 


1521 


RA 


Ras association (RalGDS/AF-6) 
domain 


o ; 013 


13.3 


1523 


RhoGAP 


RhoGAP domain 


2.5e~05 


18.7 


1528 


WD40 


WD domain, G-beta repeat 


5.4e-24 


93.1 


1535 


IMS 


impB/mucB/samB family 


7,8e-95 


328.5 


1538 


FYVE 


FYVE zinc finger 


3.2e-27 


101. S 


1539 


DAGKc 


Diacylglycerol kinase catalytic 
domain 


6e-07 


36.5 


1540 


Ocular alb 


Ocular albinism type 1 protein 


0 


1184.7 


1653 


SAP 


SAP domain 


6e-06 


33 .2 


1654 


Amino_oxirias 
e 


Flavin containing amine oxidase 


3.2e-43 


157.0 


1655 


Amino_oxidas 
e 


Plavin containing amine oxidase 


3.2e-43 


157.0 


1656 


RhoGEF 


RhoGEF domain 


1.4e-24 


95.1 


1657 


MMR_HSR1 


GTPase of unknown function 


0.0011 


-45.5 


1659 


UCH^2 


Ubiquitin carboxyl -terminal 
hydrolase family 


2.5e-ll 


51.1 


1660 


act in 


Act in 


6.6e-21 


69.9 


1661 


BAH 


BAH domain 


1.7e-82 


287.5 


1662 


vwa 


von Willebrand factor type A 
domain 


0 


1909.4 


1663 


WD40 


WD domain, G-beta repeat 


1.4e-67 


237.9 


1667 


zf-C2H2 


zinc finger, C2H2 type 


i.3e-93 


324 .4 


1669 


NollJfop2_Su 
n 


N0Ll/N0P2/snn family 


1.3e-23 


84.3 


1671 


SH2 


Src homology domain 2 


5.4e~l5 


46.9 
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SEQ ID 
NO: 


PFAM NAME 


DESCRIPTION 


p-value 


PFAM 
SCORE 


1672 


chromo 


'chromo 1 (CHRromatin 
Organization Modifier) 


2.le-i8 


67.7 


1674 


zf-CCCH 


Zinc finger C-x8-Ox5-C-x3-H 
type 


0.0025 


17.6 


1676 


Glyco_hydro 
47 


Glycoayl hydrolase family 47 


1.8e-187 


636.2 


1677 


Glyco_hydro 
47 


Glycosyl hydrolase family 47 


4.5e-74 


259.5 


1680 


WD40 


WD domain, G-beta repeat 


l.le-27 


105.5 


1681 


WD40 


WD domain, G-beta repeat 


l.le-27 


105.5 


1683 


MMRHSR1 


GTPase of unknown function 


1.8e-78 


274.1 


1621 


rrm 


RNA recognition motif . 


1.8e-37 


137,9 


1692 


rrra 


RNA recognition motif. 


1.8e-37 


137.9 


1693 


AAA 


ATPases associated with various 
cell ul ax- act 


1.3e-81 


284.5 


1697 


Ferric_jceduc 
t 


Ferric reductase like 
transmembrane com 


8.4e-82 


285.2 


1698 


Ferric reduc 
t 


Ferric reductase like 
transmembrane com 


3.5e-53 


190.1 


1693 


zf-C2H2 


Zinc finger, C2H2 type 


4.4e-34 


126.6 


1700 


arf 


ADP-ribosylation factor family 


9e~l9 


75.8 


1702 


GXP KFTU 


Elongation factor Tu family 


0.014 


11.4 


1703 


SCAN 


SCAN domain 


1.8e-54 


194.4 


1707 


pkinase 


Eukaryotic protein kinase 
domain 


1 .2e-88 


307.9 


1709 


WD40 


WD domain, G-beta repeat 


0.0035 


24 .0 


1710 


LRR 


Iieucine Rich Repeat 


1.2e-30 


115.3 


1711 


WW 


WW domain 


7.6e-12 


52.8 


1712 


ank 


Ank repeat 


4 .2e-34 


126. 7 


1713 


zf-CCCH 


Zinc finger C-x8-C-x5-C-x3-H 
type 


2.6e-09 


38.3 


1714 


zf~CCCH 


Zinc finger C-X8-C-X5-C-X3-H 1 
type 


2.6e-09 


38.3 


1715 


ras 


Ras family 


4 ,4e-41 


149. 9 


1718 


HMG_box 


HMG (high mobility group) box 


B.3e-21 


82 .6 


1719 


TBC 


TBC domain 


l .le-45 


165.2 


1721 


KLH 


Helix- loop-helix DMA- binding 
domain 


9.2e-10 


45.9 


1723 


darm 


Double- stranded RNA binding 
motif 


2.9e-05 


30.9 


1724 


RrnaAD 


Ribosomal RNA adenine 
dimethyl ases 


0.045 


9.2 


1725 


CIDE-N 


CIDK-N domain 


S.9e-40 


145.2 


1726 


HAT 


HAT (Half -A-TPR) repeats 


2.9e-44 


160.5 


1728 


efhand 


EF hand 


5.1e-20 


79.9 


1733 


Hist deacety 
1 


Hi stone deacety lane family 


1.7&-104 


360.6 


1735 


LRR 


Leucine Rich Repeat 


4.6e-34 


126.. 6 


1739 


pi-pu:-x 


Phosphat x dylinosit ol- specif ic 
phpspholipase 


0.0023 


16.1 


1743 


ras 


Ras family 


3.7e-lD 


-21.3 


1744 


ras 


Ras family 


3.7e-10 


-21.3 


1745 


RaeGEF 


RasGEF domain 


3.2e-49 j 


176.9 


1746 


adhshort 


short chain dehydrogenase 


7.1e-08 


34.6 j 


1751 


ssf-C2H2 


Zinc finger, C2H2 type 


9e-39 


142.2 


1754 


£n3 


Fibronectin type III domain 


5.5e-101 


348.9 


1756 


Z1-C2II2 


Zinc finger, C2H2 type 


6.3e-93 


322.1 


1758 


rrra 


RNA recognition motif. 


0.017 


21.2 j 


1760 


Nop 


Putative snoRNA binding domain 


6.le-95 


328.8 j 


1761 


Nop 


Putative snoRNA binding domain 


6.1e-95 


328.8 


1765 


MMR_HSR1 


GTPase of unknown function 


6.4e-41 


149.4 ] 


1769 


CN hy dr ol as e 


Carbon- nitrogen hydrolase 


3e-06 


-43.9 


1775 


ank 


Ank repeat 


4.1e-07 


37.1 


1779 


OxysterolJBP 


Oxysterol -binding protein 


4.7e-56 


199.6 


1783 


RhoGEF 


RhoGEF domain 


1.6e-23 


91.6 


1784 


RhoGBF 


RhoGEF domain 


1.6e-23 


91.6 
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SEQ ID 
NO: 


PFAM NAME 


DESCRIPTION 


p -value 


PFAM _ ! 
SCORE 


1785 


rrm 


RNA recognition motif. 


6.4e-14 


59.7 



TRADOCS:1416227.I(%CRN01!.DOC) 
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TABLES 



SEQ ID NO: 


POSITION OF 


MaxS (MAXIMUM 


Means (MB AN 




SIGNAL IN AMINO 


SCORE) 


SCORE) 




ACID SEQUENCE 






1 


1-21 


0.991 


0.955 


2 


1-31 


0.995 


0.944 


3 


1-33 


0.949 


0.736 


4 


1-19 


0.970 


0.951 


5 


1-26 


0.971 


0.863 


6 


1-26 


0.971 


0-863 


7 


1-26 


0.971 


0.863 


8 


1-26 


0.971 


0.B63 


9 


1-46 


0.982 


0.901 


10 


1-21 


0.991 


0.955 


11 


1-23 


0.989 


0.899 


12 


1-25 


0.955 


0.803 


13 


1-18 


0.932 


0.625 


14 


1-18 


0.938 


0.876 


15 


1-25 


0.941 


0.811 


16 


1-17 


0.972 


0.939 


17 


1-27 


0.964 


0.777 


18 


1-16 


0.914 


0.657 


19 


1-19 


0.953 


0.840 


20 


1-20 


0.935 


0.701 


21 


1-22 


0.974 


0.850 


22 


1-33 


0.961 


0.895 


23 


1-19 


0.991 


0.959 


24 


1-31 


0.995 


0,944 


25 


1-22 


0.976 


0.935 


26 


1-27 


0.996 


0.928 


27 


1-24 


0.953 


0.739 


28 


1-21 


0.906 


0.688 


29 


1-31 


0.986 


0.84i 


30 


1-28 


0.9 BO 


0.893 


31 


1-19 


0.993 


0.976 


32 


1-22 


0.998 


0.909 


35 


1-33 


0.949 


0-73^ 


%t 


1-33 


0.949 


0.736 


46 


1-19 


0.570 


0.951 


67 


1-25 


0.968 


0.848 


71 


1-18 


0.949 


0.845 


72 


1-30 


0.991 


0.919 


75 


1-29 


0.958 


0.854 


88 


1-20 


0.986 


0.94S 


94 


1-33 


0.994 


0.943 


97 


1-46 


0.964 


0.595 


103 


1-49 


0.983 


0.570 


108 


1-26 


0.978 


0.885 


111 


1-23 


0.9B9 


0.899 


126 


1-25 


0.955 


0.803 


129 


1-19 


0.963 


0.918 


138 


1-29 


0.971 


0.844 


143 


1-18 


0.914 


0.628 


148 


1-20 


0.969 


0.904 


156 


1-25 


0.941 


0.811 


158 


1-22 


0.979 


0.927 


160 


1-17 


0.972 


0.939 


161 


1-48 


0.903 


0.571 


162 


1-25 


0.937 


0.729 


168 


1-16 


0.939 


0.826 


171 


1-27 


0.964 


0.777 


17B 


1-21 


0.945 


0-825 


180 


1-27 


0.981 


0.941 


187 


1-28 


0.982 


0.936 


190 


119 


0.953 


0.840 


196 


1-22 


0.975 


0.916 


197 


1-22 


0 . 963 


0.936 
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SEQ ID NO: 


POSITION OF 


MaxS (MAXIMUM 


MeanS (MEAN 




SIGNAL IN AMINO 


SCORE) 


SCORE) 




ACID SEQUENCE 






199 


1-20 


0,935 


0.701 


200 


1-23 


0.977 


0.773 


206 


1-30 


0,984 


0.890 


207 


1-19 


0 . 990 


0.924 


208 


1-22 


0.974 


0.850 


210 


1-40 


0,940 


0.670 


211 


1-28 


0.971 


0.849 


216 


1-24 


0.986 


0.956 


218 


1-33 


0.961 


0.895 


219 


1-19 


0.970 


0.871 


221 { 


1-19 


0,904 


0.553 


222 


1-21 


0.917 


0.555 


230 


1-19 


0.991 


0.959 


231 


1-26 


0.953 


0.800 


232 


1-25 


0.988 


0.826 


239 


1-23 


0.969 


0.828 


240 


1-17 


0.982 


0.955 


241 


1-17 


0.982 


0.955 


245 


1-30 


0.970 


0.722 


248 


1-22 


0-976 ~ 


0.935 


249 


1-23 


0.968 


0.940 


252 


1-18 


0.971 


0.923 


261 


1-24 


0.883 


0 .587 


265 


1-Z.8 


0.939 


0.868 


272 


1-24 


0.953 


0 .739 


283 


1-21 


0.906 


0.688 


284 


1-29 


0.997 


0.854 


290 


1-31 


0.986 


0.841 


302 


1-28 


0.980 


0.893 


304 


1-16 


0.907 


0 .635 


312 


1-19 


0.993 


0.976 


313 


1-17 


0.930 


0.753 


323 


1-22 


0.998 


0.909 


324 


1-17 


0.982 


0.954 


328 


1-19 


0.971 


0.865 


329 


1-22 


0.963 


0.924 


330 


1-33 


0.978 


0.841 


331 


1-24 


0.920 


0.712 


332 


1-54 


0.975 


0.881 


333 


1-19 


0.S84 


0.941 


334 


1-20 


0.899 


0.567 


335 


1-27 


0.942 


0.813 


336 


1-20 


0.952 


0.850 


337 


1-38 


0.942 


0.653 


338 


1-27 


0.973 


0.772 


339 


1-36 


0.979 


0.804 


340 


1-27 


0.888 


0.597 


343 


1-19 


0.971 


0.865 


344 


1-22 


0.994 


0.928 


345 


1-17 


0.966 


0.687 


346 


1-19 


0.936 


0.822 


347 


1-22 


0.963 


0.924 [ 


349 


1-24 


0.982 


0.966 


351 | 


1-21 


0.918 


0.815 


352 


1-31 


0.988 


0.912 


354 


1-31 


0.974 


0.839 


355 


1-29 


0.932 


0.632 I 


356 


1-15 


0.994 


0.969 


357 


1-33 


0.935 


0.726 


360 


1-27 


0.938 


0.827 


361 


1-25 


0.954 


0.674 


362 


1-22 


0.929 


0.788 


363 


1-21 


0.881 


0.715 


364 


1-33 


0.978 


0.841 


365 


1-33 


0.978 


0.841 
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I 



SEQ ID NO: 


POSITION OF 


MaxS (MAXIMUM 


MeanS (MEAN 




SIGNAL IN AMINO 


SC0R3) 


SCORE) 




ACID SEQUENCE 






366 


1-21 


0.916 


0.820 


367 


1-19 


0.936 


0.822 


368 


1-29 


0.972 


0.874 


370 


1-24 


0.920 


0.712 


371 


1-24 


0.961 


0.773 


372 


1-27 


0.919 


0.768 


373 


1-19 


0.986 


0.945 


375 


1-32 


0.994 


0.932 


376 


1-34 


0.987 


0.810 


377 


1-17 


0.995 


0.950 


378 


1-49 


0.971 


0.749 


380 


1-20 


0.968 


0.874 


381 


1-20 


0.928 


0.782 


382 


1-19 


0.986 


0.934 


383 


1-28 


0.965 


0.829 


384 


1-39 


0.970 


0.551 


386 


1-24 


0.975 


0.881 


388 


1-30 


0.989 


0.868 


389 


1-19 


0.984 


0.941 


390 


1-26 


0.971 


0.782 


392 


1-20 


0.981 


0.900 


393 


1-16 


0.968 


0.890 


394 


1-23 


0.937 


0.701 


397 


1-22 


0.985 


0.854 


399 


1-46 


0.977 


0.698 


401 


1-20 


0.899 


0.567 


402 


1-22 


0.967 


0.931 


403 


1-27 


0.992 


0.934 


404 


1-19 


0.991 


0.973 


405 


1-23 


0.994 


0.921 


407 


1-35 


0.987 


0.658 


408 


1-39 


0.976 


0.551 


409 


1-33 


0.897 


0.570 


410 


1-25 


0.990 


0.962 


411 


1-38 


0.977 


0.827 


412 


1-20 


0-944 


0.768 


413 


1-20 


0.988 


0.965 


414 


1-46 


0.993 


0.638 


415 


1-23 


0.981 


0.940 


417 


1-29 


0.941 


0.672 


418 


1-20 


0.952 


0.8S0 


419 


1-19 


0.986 


0.967 


420 


1-29 


0.965 


0.861 


421 


1-22 


0.889 


0.785 


422 


1-48 


0.982 


0.B62 


424 


1-19 


0-979 


0.933 


428 


1-38 


0.942 


0.653 


430 


1-18 


0.947 


0.595 


432 


1-33 


0.957 


0.789 


433 


1-26 


0.979 


0.904 


434 


1-27 


0.962 


0-777 


435 


1-24 


0.998 


0.977 


436 


1-27 


0.973 


0.772 


443 


1-15 


0.966 


0.940 


448 


1-36 


0.979 


0.804 


453 


1-41 


0.958 


0.609 


455 


1-33 


0.943 


0.606 


457 


1-27 


0.868 


0.597 


462 


1-16 


0.925 


0.681 


486 


1-27 


0.972 


0.845 


495 


1-24 


0.917 


0.636 


498 


1-26 


0.993 


0.890 


505 


1-20 


0.976 


0.926 


507 


1-17 


0.966 


0.687 


510 


1-23 


0.930 


0.593 
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SEQ ID MO: 


POSITION OF 


MaxS (MAXIMUM 


Means (MEAN 




SIGNAL IN AMINO 


SCORE) 


SCORE) 




ACID SEQUENCE 






511 


1-23 


O.930 


0.593 


512 


1-23 


0.930 


0.593 


515 


1-18 


0.978 


0.956 


523 


1-19 


0.936 


0.822 


529 


1-22 


0.963 


0.924 


54S 


1-24 


0.982 


0.966 


55a 


1-30 


0.933 


0.713 


552 


1-21 


0.973 


0.912 


554 


1-23 


0.969 


0.7B4 


571 


1-21 


Q.91B 


0.815 


574 


1-31 


0.988 


0.912 


580 


1-39 


0.S25 


0.556 


594 


1-31 


0.974 


0.839 


608 


1-29 


0.932 


0.632 


609 


1-29 


0.S32 


0.632 


610 


1-21 


0.990 


0.948 


621 


1-15 


0.994 


0.969 


623 


1-33 


0.935 


0.726 


653 


1-27 


0.938 


0.827 


668 


1-22 


0.929 


0.788 


677 


1-16 


0.948 


0.807 


685 


1-21 


0.8B1 


0.715 


699 


1-22 


0 . 975 


0.816 | 


702 


1-31 


0.968 


0.898 


707 


1-16 


0.860 


0.562 


713 


1-25 


0.966 


0.743 


718 


1-19 


0,936 


0.822 


719 


1-20 


0.961 


0.824 


729 


1-29 


0.972 


0.874 


735 


1-46 


0.903 


0.598 


746 


1-14 


0.916 


0.730 


747 


1-22 


0.965" 


0 . 876 


748 


1-29 


0.968 


0.785 


759 


1-24 


0.961 


0.773 


767 


1-27 


0.919 


0.768 


768 


1-33 


0.900 


0.585 


773 


1-42 


0.959 


0.702 


779 


1-19 


0.986 


0.945 


797 


1-19 


0.944 


0.759 


798 


1-19 


0 . 900 


0.568 


820 


1-17 


0.995 


0.950 


827 


1-49 


0.971 


0.749 


848 


1-20 


0.968 


0.874 


864 


1-20 


0.928 


0.782 


866 


1-19 


0.986 


0.934 


873 


1-23 


0.948 


0.686 


881 


1-28 


0.965 


0.829 


887 


1-39 


0.970 


0.551 ! 


927 


1-30 


0.98S 


0.868 


934 


1-48 


0.988 


0 .777 


939 


1-39 


0.994 


0.889 


944 


1-26 


6.971 


0 . 782 | 


950 


1-29 


0.957 


6.845 


963 


1-20 


0.981 


0.900 


964 


1-20 


0.B86 


0.558 


973 


1-16 


0.96B 


0.890 


980 


— - — -■— - — 

1-34 


0 . 961 


0.749 


981 


1-20 


0.953 


0.822 


984 


1-12 


0.938 


0.780 


1015 


1-22 


0 . 985 


0.854 


1040 


1-46 


0.977 


0.698 


1052 


1-18 


0.969 


0.842 


1059 


1-20 


0.927 


0.867 


1065 


1-33 


0.983 


0.918 


1069 


1-22 


0.993 


0.935 
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SBQ ID WO: 


POSITION Ot" 


MaxS (MAXIMUM 


MeanS (MEAN 




SIGNAL 1H AMINO 


SCORE) 


SCORE) 




ACID SEQUENCE 






1075 


1-27 


0 .992 


0 . 934 


1080 


1-19 


0-931 


0 . 829 


1092 


1-19 


0 .991 


0 . 973 


1094 


1-46 


0 .992 


0 . 653 


1095 


1-30 


0.974 


0 . 929 


1105 


1-23 


0 -994 


0 .921 


1123 


— Tc ' 

1-35 


0.987 


0 . 658 


1138 


1-32 


0 .954 


0 .613 


1140 


1-33 


0.989 


0.789 


1142 


1-33 


0.897 


0.570 


1152 


1-25 


0 .990 


0.962 


1170 


1-38 


0.977 


0.827 


1176 


1-20 


0 .944 


0. 768 


1187 


1-20 


0.988 


0.965 


1189 


1-35 


0.967 


0.839 


1192 


1-46 


0.993 


0.638 


1193 


1-16 


0.925 


0.710 


1197 


1-29 


0.985 


0.853 


1208 


1-23 


0.981 


0.940 


1225 


1-29 


0.941 


0.672 


1245 


1-19 


0.986 


0.967 


1258 


1-29 


0.965 


o.66i 


1265 


1-22 


0.889 


0.785 


1266 


1-20 


0.944 


0.809 


1276 


1-48 


0.982 


0.862 


1292 


1-19 


0.979 


0.933 


1296 . 


1-21 


0-984 


0.944 


1297 


1-19 


0.984 


0.953 


1332 


1-38 


0 . 94 2 


0.653 


1358 


1-18 


0.947 


0.595 


1371 


1-33 


0.957 


0.789 


1380 


1-26 


0.979 n 


0.904 


1397 


1-27 


0.962 


0.777 


1399 


1-23 


0.997 


0.960 


1404 


1-24 


0.998 


0.977 


1410 


1-15 


0.946 


0.845 


1414 


1-24 


0,913 


0.588 


1415 


1-19 


0,982 


0.929 


1416 


1-12 


0.931 


0.891 


1418 


1-30 


0.933 


0.563 


1420 


1-20 


0.881 


0.561 


1421 


1-19 


0.990 


0.968 


1423 


1-17 


0 .968 


0.863 


1424 


1-21 


0.885 


0 .591 


1425 


1-24 


0.913 


0.588 


1426 


1-24 


0 .913 


0.588 


1428 


1-25 


0.967 [ 


0 . 899 


1430 


1-34 


0 . 977 


0 . 819 


1431 


1-28 


0 . 979 


0 .923 


1432 


1-36 


0 . 957 


0 .613 


1433 


1-32 


0 . 921 


0 . 753 


1434 


1-39 


0 . 983 


0 . 621 


1435 


1-25 


0 . 910 


0 . 631 






0 . 988 


0 . 868 


i A'i' l i 


I'M 


0 . 998 


0 . 980 


1442 


1-20 


0 . 918 


0 . 753 


1448 


1-12 


0.931 


0.891 


1462 


1-18 


0.968 


0.888 


1490 


1-20 


0.881 


0.561 


1518 


1-17 


0.968 


0.863 


1525 


1-21 


0.885 


0.591 


1547 


1-2B 


0.974 


0.891 


1561 


1-25 


0.967 


0.899 


1580 


1-17 


0.923 


0.824 


1593 


1-2B 


0.979 


0.923 
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SBQ ID NO: 


POSITION OP 
SIGNAL IN AMINO 
ACID SEQUENCE 


MaxS (MAXIMUM 
SCORE) 


MeanS (MEAN 
SCORE) 


1596 


1-16 


0.929 


0.709 


1601 


1-36 


0.957 


0.613 


1606 


1-22 


0.979 


0.831 


1607 


1-20 


0.974 


O.770 


1608 


1-32 


0.921 


0.753 


1614 


1-33 


0.969 


0.B29 


1616 


1-20 


0.959 


0.869 


1625 


1-39 


0.983 


0.621 


1632 


1-25 


0.910 


0.631 


1636 


1-33 


0-897 


0.591 


1639 


1-42 


0.988 


0.868 


1645 


1-20 


0.927 


0.568 


1647 


1-17 


0.923 


0.742 


1648 


1-22 


0.998 


0.980 



TRADOCS: 1 4 1 6234 J (%CR%0l ! -DOC) 
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TABLE 6 



SEQ ID NO: 


SEQ ID 


SEQ ID NO: 


SEQ ID 


Priority 


SEQ ID 


of full- 


WO: of 


of contig 


NO: 


docket number^ 


NO: in 


length 


full- 


nucleotide 


of contig 


corresponding 


U.S. S.N. 


nucleotide 


length 


sequence 


peptide 


SEQ ID NO: in 


09/488,725 


sequence 


peptide 
sequence 




sequence 


priority 
application 




1 


1787 


3573 


5359 


784CIP2_1 


1103 


2 


1788 


3574 


5360 


784CIP2_2 


2673 


3 


1789 


357,5 


5361 


784CIP2_3 


4117 


4 


1790 


3576 


5362 


784CIP2_4 


5556 


5 


1791 


3577 


5363 


784CIP2 5 


5562 


6 


1792 


3578 


5364 


784CIP2_6 


5562 


1 7 


1793 


3579 


5365 


784CIP2J7 


5562 


8 


1794 


3580 


5366 


784CIP2 8 


5562 


9 


1795 


3581 


5367 


784CIP29 


5563 


10 


1796 


3582 


5368 


784CIP2JL0 


5564 


11 


1797 


3583 


5369 


784CIP211 


5565 


12 " 


1798 


3584 


5370 


784CIP2JL2 


5689 


13 


1799 


3585 


5371 


784CIP2JL3 


5729 


14 


1800 


3586 


5372 


784CIP2JL4 


5745 


IS 


1801 


3587 


5373 


784CIP2JL5 


5777 


16 


1802 


3588 


5374 


784CIP2_16 


5777 


17 


1803 


3589 


5375 


784CIP2 17 


5789 


16 


1804 


3590 


5376 


784CIP2JL8 


5792 


19 


1805 


3591 


5377 


784CIP2_19 


5804 


20 


1806" 


3592 


5378 


784CIP2_20 


5805 


21 


1807 


3593 


5379 


784CIP2 21 


580S 


22 


1808 


3594 


5380 


784CIP2 22 


\ 5844 


23 


1809 


3595 


5381 


784CIP2_23 


5844 


24 


1810 


3596 


5382 


784CIP2_24 


5850 


25 


1811 


3597 


5383 


784CIP2 25 


5867 


26 


1812 


3598 


5384 


784CIP2 26 


5973 


27 


1813 


3599 


5385 


784CIP2_27 


5995 


28 


1814 


3600 


5386 


784CIP2_28 


5995 


29 


1815 


3601 


5387 


784CIP2_29 


6005 


30 


1816 


3602 


538 B 


784CIP2_30 


6007 


31 


1817" 


3603 


5389 


784CIP2_31 


6007 


32 


1818 


3604 


5390 


784CIP2_32 


6009 


33 


1819 


3605 


53S1 


784CIP2_33 


6012 


34 


1820 


3606 


5392 


784CIP234 


6015 


35 


1821 


3607 


5393 


7B4CIP2_35 


6016 


36 


1822 


3608 


5394 


784CIP2_36 


6016 


37 


1823 


3609 


5395 


7B4CIP2_37 


6018 


38 


1824 


3610 


5396 


784CIP238 


6018 


39 


182S 


3611 


5397 


784CIP239 


6018 


40 


1826 


3612 


5398 


784CIP2 40 


6023 


41 


1827 


3613 


5399 


784CIP2 41 


6070 


42 


1828 


3614 


5400 


784CIP2_42 


6081 


43 


1829 


3615 


5401 


784CIP2_43 


6089 


44 


1830 


3616 


5402 


784CIP2 44 


6118 


45 


1831 


3617 


5403 


784CIP245 


6118 


46 


1832 


3618 


5404 


784CIP2_46 


6130 


47 


1833 


3619 


5405 


784CIP247 


6177 


4B 


1834 


3620 


5406 


784CIP2_48 


6189 


49 


1835 


3621 


5407 


784CIP249 


6191 


50 


1836 


3622 


5408 


784CIP2 50 


6204 


51 


1837 


3623 


5409 


784CIP2 51 - 


6204 


52 


1838 


3624 


5410 


784CIP2_52 


6284 


53 


1839 


362S 


5411 


784CIP2 53 


6367 


54 


1840 


3626 


5412 


784CIP2 54 


6436 


55 


1841 


3627 


5413 


784CIP2_55 


6442 


56 


1842 


3628 


5414 


784CIP2_56 


6445 


SI 


1843 


3629 


5415 


784CIP2_57 


6457 


58 


1844 


3630 


5416 


784CIP2_58 


6458 


59 


1845 


3631 


5417 


784CIP2_59 


64S8 
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SEO ID NO* 

of full- 
length 
nucleotide 
sequence 


SKQ ID 
NO; of 
full- 
length 
peptide 
sequence 


>z>i '\J ± V msji 

Ul l-» LSI* Liy 

nucleotide 
sequence 


oty xu 
NO: 

of contiq 

peptide 

sequence 


Priority 

r)r>r*\c t~ m iml-vn- 
uvLAt: l nuiJlDer 

SEQ ID NO- in 

priority 

application 


SEQ ID 
NO: in 


60 


1846 


3632 


5418 


784CIP2_60 


6462 


61 


1847 


3633 


5419 


784CIP2 61 


6472 


62 


1846 


3634 


5420 


i 784CIP2_62 


6499 


63 


1849 


3635 


5421 


784CIP2 63 


6499 


64 


1850 


3636 


5422 


784CIP2 64 


6505 


65 


1851 


3637 


5423 


784CIP2 65 


6534 


66 


1852 


3638 


5424 


784CIP2 66 


6534 


67 


1853 


3639 


5425 


784CIP2 67 


6540 


6B 


1854 


3640 


5426 


784CIP2 68 


6550 


69 


1855 


3641 


5427 


784CIP2 69 


6550 


70 


1856 


3642 


5428 


784CIP2 70 


6592 


71 


1857 


3643 


5429 


7B4CIP2 71 


6645 


72 


1858 


3644 


5430 


784CIP2 7? 


6671 


73 


1859 


3645 


5431 




6763 


74 


1860 


3646 


5432 




6763 


75 


1861 


3647 


5433 




6786 


76 


1862 


3648 


5434 




coo* 


77 


1863 


3649 


5435 


'7Riir*TP'? *7"7 


coon 
0 0 U 


78 


1864 






TRAPTD") 7Q 


boji. 


79 


1865 


3651 


5437 


f 0*k*^W £ /y 


6832 


BO 


1866 






TBAfTDO on 
/U4LlrZ cfU 


6834 


81 


1867 








6834 


82 


1658 


3654 


544 0 


r Of* \-X mt4 JO Z 




83 


1869 


3655 


5441 


r O * » v . 1 tr £. 0 J 


683^ 


84 


1870 


3656 






/•Oil 


'" ' 85 


1871 


309 / 






6859 


86 


1872 


3658 




/O^LlfZ DP 


6915 


87 


1873 




cad r 


/O^Lil'ii t> / 


con 


88 


1874 








D» / 


89 


1875 


1 3661 


5447 






90 


1876 


3662 


"'5448" 




t>J / J 


91 


1877 


3663 


5449 




6973 


92 


1878 


3664 


5450 




fUV / 


93 


1879 


3665 


5451 




7018 " " " 


94 


1880 


3666 


5452 




*7fl7 Q 


95 


1881 i 


3667 


5453 




7020 


96 


1882 


3668 


5454 


784CIP2 97 


7020 " r- "' 


j 97 


1883 


3669 


5455 


784CIP2 98 


7021 


98 


1884 


3670 


5456 


784CIP2 99 


7023 


99 


1885 


3671 


5457 


7B4CIP2 100 


7027 


100 


1886 


3672 


5458 


784CIP2 101 


7028 


101 


1887 


3673 


5459 


784CIP2 102 


7029 


102 


1888 


3674 


5460 


764CIP2 103 


7031 


103 


1889 


3675 


5461 


7B4CIP2 104 


7032 


104 


1890 


3676 


5462 


784CIP2 105 


7033 


105 


1891 


3677 


5463 


764CIP2106 


7035 


106 


1892 


3678 


5464 


784CIP2 107 


7036 


107 


1893 


3679 


5465 


784CIP2_108 


7039 


108 


1894 


3680 


5466 


784CIP2_109 


7043 


109 


1895 


3681 


5467 


784CIP2_jL10 


7044 


110 


1894 


3682 


54 GB 


784CIP2111 


7046 


111 


1897 


3683 


5469 | 


784CIP2_112 


7054 


112 


1898 


3684 


5470 


784CIP2_113 


7061 


113 


1899 


3685 


5471 


784CIP2_114 


7677 


114 


1900 


3686 


5472 


7B4CIP2_115 


7092 


115 


1901 


3687 


5473 


784CIP2_116 


7094 


116 


1902 


3686 


5474 


784CIP2 117 


7106 


117 


1903 


3689 


5475 


7B4CIP2_118 


7107 


118 


1904 


3690 


5476 


784CIP2 119 


7111 


119 


1905 


3691 


5477 


784CIP2_120 


7123 


120 


1906 


3692 


5478 


784CIP2 121 


7142 


! 121 


1907 


3693 


5479 


784CIP2 122 


7142 
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SEQ ID NO: 


SEQ ID 




SEQ ID 


Priori ty 


SEQ ID 


of full- 


NO: of 


of contig 


MO; 




NO • in 


length 


full- 


nucleotide 


of con tig 


corresponding 


17. S.S.N, 


nucleotide 


length 


sequence 


peptide 


SEQ ID NO; in 


09/4B8, 725 


sequence 


peptide 




sequence 


priority 






sequence 






application 




122 


1908 


3694 


5480 


784CIP2123 


7154 


123 


1909 


3695 


5481 


784CIP2124 


7160 


124 


1910 


3696 


5482 


784CIP2_125 


7169 


125 


1911 


3697 


5483 


784CIP2JL26 


7185 


126 


1912 


369B 


5484 


784CIP2 127 


7197 


127 


1913 


3699 


5485 


704CIP2128 


7219 


128 


1914 


3700 


5486 


784CIP2_129 


7226 


129 


1915 


3701 


5487 


784CIP2130 


7229 


130 


1916 


3702 


5488 


7B4CIP2 131 


7234 


131 


1917 


3703 


5489 


784CIP2 132 


7235 


132 


1918 


3704 


5490 


784CIP2 133 


7235 


133 


1919 


3705 


5491 


784CIP2 134 


7238 


134 


1920 


3706 


5492 


784CIP2 135 


7247 


135 


1921 


3707 


5493 


784CIP2 136 


7261 


[ 136 


1922 


3708 


5494 


784CIP2 137 


7262 


137 


1923 


3709 


5495 


784CIP2 138 


7267 


136 


1924 


3710 


5496 


784CIP2 139 


7272 


139 


1925 


3711 


5497 


784CTP? 140 


7273 


14 0 


1926 


3712 


5498 


784 PTP2 141 

/ Oft <S 


7282 


141 


1927 


3713 


5499 


7R4f*TP? 147 


7288 


142 


1928 


3714 


5500 


784CIP2 143 


7291 


143 


1929 


3715 


5501 


784PTP3 144 


7293 


144 


1930 


3716 


5502 


7A4CIP? 145 


7294 


145 


1931 


3717 


5503 


784CIP2 146 


7299 


146 


1932 


3718 


5504 


784PTP? 147 


7300 


147 


1933 


3719 


5505 


784CIP? 14 8 


7312 


148 


1934 


3720 


5506 


7ftirTP5 14Q 


7313 


149 


1935 


3721 


5507 


7H4C"TP7 Tit) 


7315 


150 


1936 


3722 


55C8 


7fl4r*TP7 151 


7318 


151 


1937 


3723 


5509 


784CIP? 152 


7321 


152 


1938 


3724 


5510 


784CIP2 153 


7330 


153 


1939 


3725 


5511 


7B4CIP2 154 


7331 


154 


1940 


3726 


5512 


784CIP2 1 55 


7333 


15S 


1941 ! 


3727 


5513 


784CIP2 156 


7350 


156 


1942 


3728 


5514 


784CIP2 157 


7352 


157 


1943 


3729 


5515 


784CIP2 158 


7384 


158 


1944 


3730 


5516 


784CIP2 159 


7403 


159 


1945 


3731 


5517 


784CIP2 160 


7431 


160 


1946 


3732 


5518 


784CIP2 161 


7441 


16T 


1947 { 


3733 


5519 


784CIP2_162 


7453 


162 


1948 


3734 


5520 


784CIP2_163 


7467 


163 


1949 


3735 


5521 


784CIP2JL64 


7471 


164 


1950 


3736 


5522 


784CIP2JL65 


7493 


165 


19S1 


3737 


5523 


784CIP2 166 


7502 


166 


1952 


3738 


5524 


784CIP2167 


7511 


167 


1953 


3739 


5525 


784CIP2_168 


7514 


168 


1954 


3740 


5526 


784CIP2 169 


7520 


169 


1955 


3741 


5527 


784CIP2_170 


7541 


170 


1956 


3742 


5528 


784CIP2_171 


7570 


171 


1957 


3743 


5529 


784CIP2_172 


7578 


172 


1958 


3744 


5530 


784CIP2_173 


7583 


173 


1959 


3745 


5531 


784CIP2174 


7592 


174 


1960 


3746 


5532 


784CIP2_175 


7601 


175 


1961 


3747 


5533 


784CIP2_176 


7602 


176 


1962 


3748 


5534 


784CIP2 177 


7608 


177 


1963 


3749 


5535 


784CIP2_178 


7615 


178 


1964 


3750 


5536 


784CIP2_179 


7617 


179 


1965 


3751 


5537 


784CIP2_181 


7624 


180 


1966 


3752 


5538 


784CIP2182 


7626 


1B1 


1967 


3753 


5539 


784CIP2_183 


7640 


1B2 


1968 


3754 


5540 


7B4CIP2_1B4 


7641 


183 | 


1969 


3755 


5541 


784CIP2_1B5 


7641 
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COfl Tf\ un. 

oi-isj ±u Ci\J ; 

of full- 
length 
nucleotide 
sequence 


SEQ ID 
NO* of 
full- 
length 
peptide 
sequence 


OCrVl X D WU - 

nucleotide 
sequence 


eon TY\ 

NO: 

of contio: 

peptide 

sequence 


corrp Toond'trici 
SEQ ID NO: in 
priority 
application 


.SEQ ID 
NO: in 
U.S. S.N. 
09/488, 725 


184 


1970 


3756 


5542 


7B4CIP2JL86 


7641 


185 


1971 


3757 


5543 


784CIP2_187 


7642 


186 


1972 


375B 


5544 


784CIP2_188 


7649 


187 


1973 


3759 


5545 


784CIP2 189 


7656 


180 


1974 


3760 


5546 


784CIP2 190 


7657 


189 


1975 


3761 


5547 


784CIP2JL91 


j 7657 


| 190 


1976 


3762 


5548 


784CIP2_192 


7662 


191 


1977 


3763 


5549 


784CIP2_193 


766B 


192 


1978 


3764 


5550 


784CIP2_194 


7673 


193 


1979 


3765 


5551 


784CIP2195 


7690 


194 


1980 


3766 


5552 


784CIP2JL96 7700 


195 


1981 


3767 


5553 


784CIP2197 ; 7709 


196 


1982 


3768 


5554 


784CXP2 198 


7736 


197 


1983 


3769 


5555 


784CIP2_JL99 


7737 


198 


1984 


3770 


5556 


784CIP2_200 


7744 


199 


1985 


3771 


5557 


784CIP2__20l 


7771 


200 


19B6 


3772 


5558 


784CIP2_202 


7786 


201 


1987 


3773 


5559 


784CIP2__203 


7791 


202 


1988 


3774 


5560 


784CIP2__204 


7797 


203 


1989 


3775 


5561 


784CIP2__205 


7806 


204 


1990 


3776 


5562 


784CIP2_206 


7812 


205 


1991 


3777 


5563 


/B4CIPZ__ZGY 


7812 


206 


1992 


3778 


5564 


7o4CIJP2_2Qo 


7818 


207 


1993 


3779 


5565 




7822 


200 


1994 


3780 


5566 


/ o 4 CI Jr J- U 


TQ07 


209 




3 /Di 


ODU / 


/O^UrA A, XI. 


• 7830 


210 


1995 


3782 


5568 




7835 


211 


1997' ~ 


3783 


5569 


784CIP2 214 


7840 


! 212 


1998 


3784 


5570 


784CIP2 215 


7858 


213 


1999 


3785 


5571 


784CIP2 216 


7858 


214 


2000 


3706 


~ ■■ 5 5 72 


784CIP2 217 


7061 


215 


2001 


3787 


5573 


784CIP2_218 


7866 


216 


2002 


3788 


5574 


784CIP2J219 


9868 


217 


2003 


3789 


5575 


784CIP2220 


7896 


218 


2004 


3790 


5576 


784CIP2 221 


7898 


219 


2005 


3791 


5577 


784CIP2_222 


7900 


220 


2006 


3792 


5578 


784CIP2 223 


7906 


221 


2007 


3793 


5579 


784d*P2_224 


7908 


222 


2008 


3794 


5580 


784CIP2225 


7909 


223 


2009 


3795 


5581 


784CIP2_226 


7917 


224 


2010 


3796 


5582 


784CIP2J227 


7932 


225 


2011 


3797 


5583 


784CIP2 228 


7940 


226 


2012 


3798 


5584 


784CIP2_229 


7940 


227 


2013 


3799 


5585 


784CIP2_230 


7984 


228 


2014 


3800 


5586 


784CIP2231 


7984 


229 


2015 


3801 


5587 


784CIP2_232 


8001 


230 


2016 


3802 


5588 


784CIP2233 


8021 


231 


2017 


3803 


5589 


784CIP2_234 


8029 


232 


2018 


3804 


5590 


784CIP2 235 


8033 


i 233 


2019 


3805 


5591 


784CIP2236 


8040 


234 


2020 


3806 


5592 


784CIP2_23? 


8052 


235 


2021 


3807 


5593 


784CIP2 238 


8096 


236 


2022 


3808 


5594 


784CIP2_239 


8096 


237 


2023 


3809 


5595 


784CIP2240 


8113 


238 


2024 


3810 


5596 


784CIP2_241 


8126 


239 


2025 


3811 


5597 


784CIP2_242 


0132 


240 j 


2026 


3812 


5598 


784CIP2_243 


8137 


241 


2027 


3813 


5599 


784CIP2244 


8137 


242 j 


2028 


3814 


5600 


784CIP2 245 


8159 


243 


2G29 


3815 


5S01 


784CIP2246 


8159 | 


244 


2030 


3816 


5602 


784CIP2J247 


8161 


245 


2031 


3817 


5603 


784CIP2 248 


B176 
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SEQ ID NO: 


SEQ ID 


SBQ ID NO: 


SBQ ID 


Priority 


SEQ ID 


of full- 


NO: of 


of contig 


NO: 


docket number^ 


NO: in 


length 


full- 


nucleotide 


of contig 


corresponding 


U.S. S.N. 


nucleotide 


length 


sequence 


peptide 


SEQ ID NO: in 


09/488,725 




pepcxae 




sequence 


priority 






sequence 






application 




246 


2032 


3818 


K 604 


/o4CIP2__249 


8196 


247 


2033 


jo Ay 


5605 


/H4UIP2 250 


8200 


248 


2034 






/84CIP2 251 


; 8212 


249 






5607 


fU4t_IP2 252 


j 8220 


250 






5608 


784CIP2 253 


8238 




/UJ / 


3823 


5609 


784CIP2 254 


| 8254 






3U24 


5610 


784CIP2 255 


| 8255 


253 


2039 


3825 


5611 


784CIP2 256 


8288 




2040 


3826 


5612 


784CIP2_257 


8296 


255 


2041 


3827 


5613 


784CIP2 258 


8329 


256 


2042 


3828 


5614 


784C1P2 259 


8362 


257 


2043 


3829 


5615 


784CIP2 260 


8429 


258 


2044 


3830 


5616 


784CIP2_261 


8436 ■ 


259 


2045 


3831 


5617 


784dP2_262 


8448 


260 


2046 


3832 


5618 


. 784CIP2J263 


8472 


261 


2047 


3833 


5619 


784CIP2264 


8502 


262 


2048 


3834 


5620 


784CIP2_265 


8504 


263 


2049 


3835 


5621 


784CIP2_266 


8507 


264 


2050 


3836 


5622 


784CIP2 268 


8509 


265 


2051 


3837 


5623 


784CIP2 269 


8515 


266 


2052 


3B38 


5624 


784CIP2_270 


8519 


267 


2053 


3839 


5625 


784CIP2_271 


8530 


268 


2054 


3840 


5626 


784CIP2J272 


8532 


269 


2055 


3841 


5627 


784CIP2 273 


8532 


270 


2056 


3842 


5628 


784CIP2 274 


8539 


271 


2057 


3843 


5629 


784CIP2_275 


8541 


272 


2058 


3844 


5630 


784CIP2_276 


8543 


273 


2059 


3845 


5631 


784CIP2 277 


8593 


274 


2060 


3846 


5632 


784CIP2 278 


8595 


275 


2061 


3847 


5633 


784CIP2_279 


8615 


276 


2062 


3848 


5634 


784CIP2 280 


8620 


277 


2063 


3849 


5635 


784CIP2__281 


8621 


278 


2064 


3850 


5636 


784CIP2 282 


8623 


279 


2065 


3851 


5637 


784CIP2283 


8625 


280 


2066 


3852 


5638 


7B4CIP2_284 


8628 


281 


2067 


3853 


5639 


7B4CIP2 285 


8628 


282 


2068 


3854 


5640 


784CIP2286 


8629 


283 


2069 


3855 


5641 


784CIP2_287 


8630 


284 


2070 


3856 


5642 


784CIP2_288 


8631 


285 


2071 


3857 


5643 


784CIP2_289 


8633 


286 


2072 


3858 


5644 


784CIP2_J290 


8634 




2073 


3 859 


5645 


784CIP2_291 


863S 




2074 


3860 


5646 


784CIP2 292 


8636 


289 




3861 


5647 


784CIP2 293 


8659 


290 


*u IX> 


3862 


5648 


7B4CIP2 294 


8660 


291 


2077 


3863 


5649 


7B4CIP2 295 


8667 


292 


2078 


3864 


5650 


*J96 


8667 


293 


2079 


3865 


DoDi 1 


/o4Cl P2__Z9 / 


8685 


294 


2080 


O tJO"> 




/a«*L.IP2_298 


8805 


295 


2081 


3867 


5653 


/o4a_IP2_299 


8896 


296 


2082 


3868 




/04C:JLPZ 3U0 


8978 


297 


2083 




30 33 


784CIP2 301 


9046 


298 


2084 


3870 


5656 


784CIP2 302 


9048 


299 


2085 


3871 


5657 


784CIP2 303 


9116 


300 


2086 


3872 


5658 


784CIP2304 


9195 ! 


301 


2087 


3873 


5659 


7B4CIP2_305 


9201 


302 


2088 


3874 


5660 


784CIP2 306 


9307 


303 


2089 


3875 


5661 


784CIP2 307 


9321 


304 


2090 


3876 


5662 


784CIP2308 


9397 


305 


2091 


3877 


5663 


784CIP2 309 


9405 


306 


2092 


3878 


5664 


784CIP2 310 


9406 


307 


2093 


3879 


5665 


784C1P2 311 


9422 
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SEQ ID NO: 


SEQ ID 


SEQ ID NO: 


SEQ ID 


Priority 


SEQ ID 


of full- 


NO: of 


of conticr 


NO: 


docket number^ 


NO: in 


length 


full- 


nucleotide 


of con tig 


corresponding 


U.S. S.N- 


nucleotide 


length* 


sequence 


pept ide 


oEy JLU WO: in 


09/488,725 


sequence 


peptide 

0£*fTt tan ***** 




sequence 


priori ty 

cipjjx x «-di_ J-CI1 




308 




3980 


" 5666 


7fl4f*TP9 11 O 


DA Qlt 


JUS 


2095 


JOOl 


5667 




QC1 O 


Offl " " 

J xu 




3882 


5668 


/OlV-XJt*^ 3X4 


yo3<s 


J J. J. 


OflQ"7 




5669 


TftAPTTJO H t 
/ 09l-.XJr.fi -JXD 


" qcZi 

jbfiX 






3884 


5670 


■7H4PTT30 "11 C 
» o ** ^-Xir<i J XO 


?dd4 


313 


2099 


-inoe 


DO IX 


/ o4Lll'Z 31/ 


9691 


314 


2100 


JBOb 


~ eel-) 

5b /<£ 


• *1ftvir*Tm 11 o 
/S4l,i^ 3Xo 


3 /OO 


315 


2101 


3887 


5o73 


/84t-lt»^ 319 


9716 


316 


2102 


3888 


5674 


/ O 4 L.I F2__3 2 0 


9721 


317 


2103 


38B9 


5675 


784CIP2 321 


9870 


318 


2104 


3890 


5676 


784CIP2_322 


9887 


319 


2105 


3891 


5677 


784CXP2_323 


9923 


320 


2106 


3892. 


5678 


784CIP2__324 


9938 


321 


2107 


3893 


5679 


784ClP2_325 


9964 


322 


2108 


3894 


5680 


784CIP2_326 


10007 


323 


2109 


3B95 


5681 


784CIP2_327 


10009 


324 


2110 


3896 


5682 


784CIP2_328 


10046 


325 


2111 


3897 


5683 


784CIP2_329 


10156 


326 


2112 


3898 


5684 


784CIP2_330 


10276 


327 


2113 


3899 


5685 


784CIP2_331 


10283 


328 


2114 


3900 


5686 


784CIP2B1 


152 


329 


2115 


3901 


5687 


784CIP2B_2 


167 


330 


| 2116 


3902 


5688 


784CIP2B_3 


205 


331 


2117 


3903 


5689 


784CIP2B __4 


210 


332 


2118 


3904 


5690 


| 784CIP2B 5 


225 


333 


2119 


390S 


5691 


784CIP2B_6 


226 


334 


2120 


3906 


5692 


784CIP2B_7 


264 


335 


2121 


3907 


5693 


784CIP2B__8 


268 


336 


2122 


3908 


5694 


784CIP2B_9 


293 


337 


2123 


3909 


5695 


784CIP2B_10 


293 


338 


2124 


3910 


5696 


784CIP2B_11 


293 


339 


2125 


3911 


5697 


784CIP2B_12 


302 


340 


2126 


3912 


5698 


784CIP2B 13 


311 


341 


2127 


3913 


5699 


784CIP2B 14 


352 


342 


2128 


3914 


5700 


784CIP2B_15 


358 


343 


2129 


3915 


5701 


784CIP2B_16 


368 


344 


2130 


3916 


5702 


784CIP2BJ17 


393 


345 


2131 


3917 


5703 


784CIP2B_18 


477 


346 


2132 


3918 


5704 


784CIP2B_19 


508 


347 


on « 

2133 


3919 


5705 


70/trrp'?D on ~ 

to 4 CI PZB__Z a 


508 


348 


2134 


3920 


5706 


fti i t\~xVZo 4.1 


515 


349 


2135 


3921 


5707 


784CIP2B__22 


578 


350 


2136 


1 ooo 


C7AD 


ft)4Li FfiO Z3 


588 


351 


2137 


3:f £3 


5709 


TUilPTDID O/l 
/ o «* v,X Ir Zd_^{ 9 


591 


*D CO 

3D* 


O 1 1 ft 


1QOA 


D 111) 


ia a/"*tt>oi* oc 
/OflV Lir^tJ ^c!> 


D3J 




oi i a 


3925 


O / XX 


/D*WX Jf<<St» <SO 






2140 


3926 


5712 


7nd.r , TT>on 0*7 


619" " 


355 


2141 


3927 


5713 




620 


IKK 


2142 


3928 


5714 


/ o4v_Xr/n ^ J 


■ - 


357 


2143 


3929 


5715 


/O^iv*XJrfi0 JU 


692 


358 


2144 


3930 


5716 




753 


I^Q 

jj? 


2145 


3931 


5717 


"7ftdr*Tt>on *»o 


758 


360 


2146 


3932 


5718 


784CIP2B 33 


787 


361 


2147 


3933 


5719 


784CIP2B34 


833 


362 


2148 


3934 


5720 


7B4CIP2B35 


838 


363 


2149 


3935 


5721 


784CIP2B_36 


870 


364 


2150 


3936 


5722 


7B4CIP2B_37 


891 


365 


2151 


3937 


5723 


7B4CIP2B 38 


891 


366 


2152 


3938 


5724 


784CIP2B39 


921 


367 


2153 


3939 


5725 


784CIP2B_40 


924 


368 


2154 


3940 


5726 


784CIP2Bj41 


932 


369 


2155 


3941 


5727 


784CIP2B 42 


942 
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cRn m fun • 


SSQ ID 


pen ttv mo* 


SBQ ID 


Priority 


SBQ ID 


of full- 


JiU : or 




NO ~ 


docket number_ 


ISO: in 


length 


full- 


micl prtt*iHp 

UUViXCiV/UXUC 




c or r e sp ond i nj 




nucleotide 


lenath 


sequence 


peptide 


SEO TD WO ■ -in 


QQ/Afift 70C 
u J f ** a o # / £3 


sequence 


peptide 




sequence 


priority 






sequence 






appl ica t ion 




| 370 


2156 


3942 


5728 


784CIP2B 43 


958 


371 


2157 


3943 


5729 


784CIP2B 44 


968 


372 


2158 


3944 


5730 


784CIP2B 45 


992 


373 


2159 


3945" 


5731 


784CIP2B 46 


1025 


374 


2160 


3946 


5732 


784CIP2B 47 


1074 


375 


2161 


3947 


5733 


784CIP2B 48 


1104 


376 


2162 


3948 


5734 


784CIP2B 49 


1114 


377 


2163 


3949 


5735 


784CIP2B 50 


1144 


378 


2164 


3950 


5736 


784CIP2B 51 


1262 


379 


2165 


39S1 


5737 


784CIP2B 52 


in r 


380 


2166 


3952 


5738 






381 


2167 


3953 


5739 


*784f*TP3n tiA 




382 


2168 


3954 


5740 






383 


2169 


3955 


5741 


/ OH V^XIr^lo OO 


1464 


384 


2170 


3956 


i KfA o 


» D«± l-.XJT4S.D_3 / 


1584 


38S 


217X 


3957 


CJ7A "i 




1617 


386 


2172 


■> 2*90 


HI A A 


"7 O >t /~*T DID CO 


1724 


387 


2173 


3959 




/a«±\_.X r^de DU 


1728 


388 


2174 








1772 


389 




3961 


5747 


/84C1P2H 62 


1809 


390 


2176 




C*7A Q 




1868 




2177 


3963 


5749 


7 8 4 CI P2B__6 4 


1898 


392 


2178 




575Q 


/o4Llr^i} DO 


1926 


393 


2179 


3965 


5751 


/o4L-j.t"^hJ bb 


1965 




4 XoU 


-TqP7 


-> '->«£ 


/B4CIP2B__b / 


1967 


39S 




J 5*o / 


5753 


7a4CIP2B_©B 


1995 






Oca 


5754 


/ B 4 CI P2B_6 9 


2005 . 


■JOT 

■>2I / 


Ol fit 




5755 


/B4CIP2B /0 


2027 


398 


ZXDf* 


39*70 


» /bo 


/04C1P2B /I 


2055 


399 




1(171 




/a'll.X.F^iS I & 


2103 


* 400 


2186 


3972 


5758 


*7nAr'Tt>*>ia *t*» 


2106 


401 


2187 


j j t j 


5759 


/O's'UXi'^sO 


2166 


402 


2188 




S760 


7B/irTDOn ftr 

/0«Sl-X.fc* 4SU /*> 


2175 


403 


2189 


3 975 


qrei 

3 /OX 


7DAPTDOD *7£ 


2176 


4 04 


2190 


3 976 


5762 


/09» — LfZt*__ /O 


2236 


405 


2191 


3977 


5763 


7fl4PTDOn *7Q 


2250 


406 


2192 


3978 


5764 


7RAHTD7B (111 
/ D'iL.J.r/Q DU 


23' 06 


407 


2193 


3979 


- 5765 


*7ftAl r *TI>''>n (17 
/04'L.X.ir4:Ja OA 


07*)1 


408 


2194 


3980 


S7S6 




2340 


409 


2195 


3981 


5767 


784CIP2B 83 


2371 


410 


2196 


3982 


5768 


784CIP2B 84 


2399 


411 


2197 


39B3 


5769 


784CIP2B 85 


2411 


412 


2198 


3984 


5770 


784CIP2B 86 


2428 


413 


2199 


3985 


5771 


784CIP2B 87 


2430 


414 


2200 


3986 


5772 


764CIP2B 88 


2439 


415 


2201 


3987 


5773 


784CIP2B 89 


2447 


416 


2202 


3988 


5774 


784CIP2B 90 


2461 


417 


2203 


3989 


5775 


784CIP2B 91 


2487 


418 


2204 


3990 


5776 


784CIP2B 92 


2492 


419 


2205 


3991 


5777 


784CIP2B 93 


2512 


| 420 


2206 


3992 


5778 


784CIP2B 94 


2564 


42t 


2207 


3993 


5779 


784CIP2B 95 


2678 


422 


2208 


3994 


5780 


784CIP2B_96 


2816 


423 


2209 


3995 


5781 


784CIP2B97 


2818 


424 


2210 


3996 


5782 


784CIP2B98 


2819 


425 


2211 


3997 


5783 


7B4CIP2B_99 


2943 


42$ 


2212 


3998 


5784 


7B4CIP2B_100 


3137 


427 


2213 1 


3999 


5785 


784CIP2B_101 


3137 


428 


2214 


4000 


5786 


784CIP2B_102 


3160 


429 


2215 


4001 


5787 


784CIP2B103 


3323 


430 


2216 


4002 


5788 


784CIP2B_104 


3360 


431 


2217 ! 


4003 


5789 


784CIP2B 105 


3362 
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QPO ID NO • 






SBQ ID 


rli on L.jr 


■JR/ -LJJ 


of full- 






NO: 


UlAJVCL UUHIUCI 


NO: in 


length 


full- 


nucleotide 


of con tig 


corresponding 


U.S. S.N. 


nucleotide 


length 


sequence 


peptide 


SKQ ID NO: in 


09/488,725 


sequence 


peptide 




sequence 


priority 






sequence 






application 




432 


1 2218 


4004 


5790 


784CIP2B 106 


3417 


433 


2219 


4005 


5791 


784CIP2B_107 


3418 


434 


2220 


4006 


5792 


7B4CIP2B_108 


3442 


435 


2221 


4007 


5793 


784CIP2B_109 


3442 [ 


436 


2222 


4008 


5794 


784CIP2B_110 


3444 


437 


2223 


4009 


5795 


7B4CIP2B111 


3855 


438 


2224 


4010 


5796 


784CIP2BJ112 


3863 


439 


2225 


4011 


5797 


784CIP2B113 


4090 


440 


2226 


4012 


5798 


784CIP2B_114 


4105 


441 


2227 - 


4013 


5799 


784CIP2B__115 


4142 


442 


2228 


4014 


5800 


784CIP2B 116 


4142 


443 


2229 


4015 


5801 


784CIP2B 117 


4149 


444 


2230 


4016 


5802 


784CIP2B 118 


4196 


445 


2231 


4017 


5803 


784CIP2B 119 


4202 


446 


2232 


4018 


5804 


784CIP2B 120 


4274 


447 


2233 


4019 


5805 


784CIP2B 121 


4304 


448 


2234 


4020 


5806 


784CIP2B 122 


4306 


449 


2235 


4021 


! 5807 


784CIP2B 123 


4311 


450 


2236 


4022 


5808 


784CIP2B 124 


4321 


451 


2237 


4023 


5809 


784CIP2B 125 


4323 


452 


2238 


4024 


5810 


784CIP2B 126 


4332 


453 


2239 


4025 


5811 


7B4CIP2B 127 


4488 


454 


2240 


4026 


5812 


7B4CIP2B 128 


4588 


455 


2241 


4027 


5813 


784CIP2B 129 


5569 


456 


2242 


4028 


5814 


784C1P2B 130 


55*73 


457 


2243 


4029 


5815 


784CIP2R 131 


5577 


458 


2244 


4030 


5816 


784CIP2B 132 


5579 


453 


2245 


■ 403 i 


5817 


784CIP2B 133 


5582 " 


460 


2246 


4032 


5818 


784CIP2B 134 


5583 


461 


2247 


4033 


5819 


7B4CIP2B 135 


5584" 


462 


2248 


4034 


5820 


784CIP2B" 136 


5585 


463 


2249 


4035 


5821 


784CIP2B 137 


5591 


464 


2250 


4036 


5822 


784CIP2B 138 


5593 


465 


2251 


4037 


5823 


784CIP2B 139 


5594 


466 


2252 


4038 


5824 


784CIP2B 140 


5594 


467 


2253 


4039 


5825 


784CIP2B 141 


5598 


468 


2254 


4040 


5826 


784CIP2B 142 


5602 


469 


2255 


4041 


5827 


784CIP2B143 


5605 


470 


2256 


4042 


5828 


784CIP2B_144 


5608 


471 


2257 


4043 


5829 


784CIP2B 145 


5617 


472 


2258 


4044 


5830 


784CIP2B_146 


5620 


473 


2259 


4045 


5831 


784CIP2B 147 


5622 


474 


2260 


4046 


5832 


784CIP2B148 


5623 


475 


2261 


4047 


5833 


7B4CIP2BJL49 


5624 


476 


2262 


4048 


5834 


784CIP2B_150 


5625 


477 


2263 


4049 


5835 


784C1P2B_151 


5627 


478 


2264 


4050 


5836 


784CIP2B_152 


5628 


479 


• 2265 


4QS1 


5837 


784CIP2B153 


5*36 


480 


2266 


4052 


5838 


7B4CIP2B_154 


5632 


481 


2267 


4053 


5839 


784CIP2B 155 


5640 


482 


2268 


4054 


5840 


7B4CIP2B_156 


5641 


483 


2269 


4055 


5841 


7B4CMP2B157 


5643 


484 


2270 


4056 


5B42 


784CIP2B_158 


5647 


485 


2271 


4057 


5843 


784CIP2B_159 


5649 


486 


2272 


4058 


5844 


784CIP2B_160 


5658 


487 


2273 


4059 


5845 


784CIP2B 161 


5659 


488 


2274 


4060 


5846 


784CIP2B 162 


5667 


489 


2275 


4061 


5B47 


784CIP2B_163 


5672 


490 


2276 


4062 


5848 


7B4CIP2B_164 


5674 


_491 


2277 


4063 


5849 


784CIP2B 165 


5678 


492 


2278 


4064 


5BS0 


784CIP2B166 


5680 


493 


2279 


4065 


5851 


784CIP2BJL67 


5684 
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SBQ ID 


fa fin Tn VTY"\ 

h>E\j XD NO : 


SBQ ID 


Priority 


SEQ ID 


of full- 




of coiifcxg^ 


NO * 


docket number__ 


NO: in. 


Xengtti 


full- 


nuclpot* 1 rfi* 


of font - "la 






nucleotide 


length 


sequence 


peptide 


SBQ ID NO- in 


nq/ADQ 7"?c 

U7/4DO, '*J 


sequence 


peptide 




sequence 


priority 






sequence 






application 




494 


2280 


4066 


5852 


784CIP2B 168 


5686 


495 


2281 


4067 


5853 


784CIP2B_169 


5694 


496 


2282 


4068 


5854 


784CIP2B 170 


5698 


497 


2283 


4069 


5855 


7B4CIP2B 171 


5699 


498 


2284 


4070 


5856 


784CIP2B 172 


5712 


499 


2285 


4071 


5857 


784CIP2B 173 


5719 


500 


2286 


4072 


5858 


784CIP2B 174 


5720 


501 


2287 


4073 


5859 


784CIP2B 175 


5727 


502 


2288 


4074 


5860 


784CIP2B 176 


5730 


503 


2289 


4075 


5861 


784CIP2B 177 


5734 


504 


2290 


4076 


5862 


7ft4r"TP1>n 7 7R 




505 


2291 


4077 


5863 


' 7B4PTP9R 179 


5739 


506 


2292 


4078 


5864 


7B4PTmn i on 




507 


2293 


4079 


5865 


7fl4r , TDt?R 1 ft! 


C74 A 


SOB 


2294 


4080 


OOO D 


7fldPTD7R 1 P.O 


5748 


509 


2295 


4081 


5867 




5749 


510 


2296 


4082 


DODO 


/04V~XirZl? 1U4 


5750 


511 


2297 


4083 


3Cby 


/ o 4 C J. i*ZD_±o b 


5750 


512 


2298 


Ana a 


5870 


IQAPT'OOn 1QC 


5750 


513 


2299 


4085 


5871 


/04(_J.F2i> lb/ 


5761 


514 


2300 




cot> 
±>a /<£ 


/ lob 


5762 


515 




4087 


5873 


fH4CxPZB 189 


5767 


516 


2302 


4088 


5874 




5773 


517 




4089 


5875 




5^ 7 83 


518 ~ 


2304 


4090 


5876 




5784 


519 






5877 




5788 


— 520 


AJUD 


At\ao 


5878 


/H4CIP2B 194 


5798 


521 


2307 ! 




SO iy 




5807 


522 


2308 


4094 


coart 

DODW 




5818 


523 


2309 




COD1 

Do a ± 




5819 


"524 


2310 


4096 






5827 


525 


2311 


4097 


5883 




coo a 


526 


2312 


4098 


5884 




RHA7 


-527 


2313 


4099 


5885 




5853 


528 


2314 


4 100 


5886 




JODl 


529 


2315 


4101 


5887 




JOD4 


530 


2316 




caap 

DOOD 


7flflPTD9R one 




531 


2317 


4103 


5889 




5871 


532 


2318 


4104 


5890 


784PTP9B 907 




533 


2319 


4105 


5891 




5873 


534 


2320 


4106 


5892 


784CIP2B 209 


5875 


535 


2321 


4107 


5893 


7B4CIP2B 210 


5878 


536 


2322 


4108 


5894 


784CIP2B 211 


5879 


537 


2323 


4109 


5895 


784CIP2B 212 


5880 


53B 


2324 


4110 


5896 J 


784CIP2B 213 


5880 


539 


2325 


4111 


5897 


784CIP2B_214 


5880 


540 


2326 


4112 


5898 


784CIP2B 215 


5880 


541 


2327 


4113 


5899 


784dP2B_216 


5885 


542 


2328 


4114 


5900 


784CIP2B 217 


5895 


543 


2329 


4115 


5901 


784CIP2B 218 


5898 


544 


2330 


4116 


5902 


784CIP2B 219 


5902 


545 


2331 


4117 


5903 


784CIP2B 220 


5904 


546 


2332 


4118 


5904 


784CIP2B_221 


5918 


547 


2333 


4119 


5905 


7B4CIP2B_222 


5921 


548 


2334 


4120 


5906 


784CIP2B 223 


5927 


549 


2335 


4121 


5907 


7B4CIP2B_224 


5932 


550 


2336 


4122 


5908 


784CIP2B 225 


5939 


551 


2337 


4123 


5909 


784CIP2B 226 


5945 


552 


2338 


4124 


5910 


784CIP2B 227 


5946 


5^53 


2339 


4125 


5911 


784CIP2B228 


5947 


554 


2340 


4126 


5912 


784CIP2B 229 


5956 


555 


2341 


4127 


5913 


784CIP2B 230 


5967 
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c tnt t r> ton - 


SEQ ID 


c" nw xn urn * 








of full- 


«<>/: 01 


on con tig 


NO: 




NO : in 




full- 




of eont id 


crorTeBtiririfii tict 

WV-k -L. t*PyUIH 1 1 II J 


O S R N 


nucleotide 


length 


sequence 


peptide 


SBQ ID NO: in 


09/488,725 


sequence 


peptide 




sequence 


priority 






sequence 






application 




556 


2342 


4128 


5914 


7B4CIP2B 232 


5975 


557 


2343 


4129 


5915 


7B4CIP2B233 


5977 


558 


2344 


4130 


5916 


784CIP2B234 


5978 


559 


2345 


4131 


5917 


784CIP2B_235 


5979 


560 


2346 


4132 


5918 


784CIP2B 236 


5980 


561 


2347 


4133 


5919 


784CIP2B 237 


5988 


562 


2348 


4134 


5920 


784CIP2B__238 


5989 


563 


2349 


4135 


5521 


784CIP2B__239 


5991 


564 


2350 


4136 


5922 


784CIP2B 240 


5997 


565 


2351 


4137 


5923 


784CIF2B 241 


5998 


566 


2352 


f 4138 


5924 


784PIP2B 242 


6003 


567 


2353 


4139 


5925 


784CIP2B 243 


6004 


566 


2354 


4140 


5926 


784CIP2B 244 


6013 


569 


2355 


4141 


5927 


784CIP2B 245 


6028 


570 


2356 


4142 


5928 


7ft4C*TP3ft S>4C 

/ D ^t\^J,Jtr ZD ^ 1 D 


6028 


571 


2357 


4143 


5929 


7fl4("'TP2B 247 


6029 


572 


2358 


4144 


5930 


TRAfTP^B 'jAr 


6031 


573 


2359 


4145 


5931 


7ft4nTD9R 


6031 




23"6"0~ 


4146 




7R4r , TDon 


6*032 


575 


£• J t>_L 


* JLfll / 




*7nAr*mon 011 
/n*vJUr J. 




576 




4148 


5934 






577 


2363 


A 1 il Q 






6043 


3 fa 


o-xAa 








dflA A 


!> /If 










1 go46 






11 CO 


cam 




6048 


581 


>i.>0 / 


Al CI 




IftAPTDDII TCI 

/ 0 l ±\~Xk?^D Z3 / 




COT 
SOX 


-£ JO O 


* A 94 


' 5940 


T0Ar»TPon oca 


tfoSi 


bb^ 


2369 


4155 


5941 


# alV Li ilD 137 


6053 




2370 


4156 


5942 


7A4f»TP9B ^fiO 


6060 


585 


2371 


4157 


5943 


7B4ptp:?b 261 


6063 


586 


2372 


4158 


5944 


784C1P2B 262 


6066 


587 


2373 


4159 


5945 


TJMPTP^B ?61 


6067 


588 


2374 


4160 




7A4PTP9B ?64 


6068 


589 


2375 


4161 


5947 


7R4r*TPPB 


6073 


590 


23 76 


4162 


5948 


/OiiwXrAjo ADD 


6076 


591 


2377 


4163 


5949 


7R4CIP2B 267 


6076 


592 


2378 


4164 


5950 


784CIP2B 268 


6077 


593 


2379 


4165 


5951 


784CIP2B 269 


6079 


594 


2380 


4166 


5952 


7B4CIP2B 270 


6082 


595 


2381 


4167 


5953 


784C1P2B 272 


6088 


59$ 


2382 


4168 


5954 


784CIP2B 273 


6091 


597 


2383 


4169 


5955 


784CIP2B_274 


6094 


598 


2384 


4170 


5956 


784CIP2B_275 


6101 


599 


2385 


4171 


5957 


784CIP2B276 


6103 


600 


2386 


4172 


5958 


784CIP2B_277 


6104 


601 


2387 


4173 


5959 


784CIP2B_278 


6108 


602 


2388 


4174 


5960 


784CIP2B_279 


6112 


603 


2389 


4175 


5961 


784CIP2B_280 


6121 


"604 


2390 


4176 


5962 


784CIP2B281 


6125 


605 


2391 


4177 


5963 


784CIP2B_282 


6126 


606 


2392 


4178 


5964 


784CIP2B_J283 


6128 


607 


2393 


4179 


5965 


784CIP2B_284 


6129 


608 


2394 


4180 


5966 


784CIP2B285 


6133 


609 


2395 


4181 


5967 


784CIP2B_286 


6133 


610 


2396 


4182 


5968 


784CIP2B_287 


6135 


611 


2397 


4183 


5969 


784CIP2B_288 


6139 


612 


2398 


4184 


5970 


7B4CIP2B 289 


6141 


613 


2399 


4185 


5971 


784CIP2B_290 


6145 


614 


2400 


4186 


5972 


784CIP2B 291 


6146 


615 


2401 


4187 


5973 


784CIP2B 292 


6148 


616 


2402 


4188 


5974 


784CIP2B293 


6149 


6l"> " 


2403 . 


4189 


5975 


784CIP2B 294 


6149 
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WO 01/53312 



PCT/US00/34263 



J.u wu . 




i>aS2 Xu aO: 


SEQ U> 


Priority 


SEQ ID 


of full- 


NO - nf 
H« - OX. 


oi cuntly 




docket number 


NO; in 


length 


full- 


nucleotide 


of con t icj 




u s s w 


nucleotide 


length 


sequence 


neotide 


SEQ ID NO; in 


QQ/Afin TOR 


sequence 


peptide 




sequence 


priori ty 






sequence 






application 




618 


2404 


4190 


5976 


7B4CIP2B 295 


6153 


619 


2405 


4191 


5977 


784CIP2B 296 


6159 


620 


2406 


4192 


5978 


784CIP2B 297 


6164 


621 


2407 


4193 


5979 


784CIP2B 298 


6167 


622 


2408 


4194 


5980 


784CIP2B 299 


6172 


623 


2409 


4195 


5981 


784CIP2B 300 


6173 


624 


2410 


4196 


5982 


784CIP2B 301 


6190 


! 625 


2411 


4197 


5983 


784CIP2B 302 


6194 


626 


2412 


4198 


5984 


784CIP2B 303 


6196 


627 


2413 


4199 


5985 


784rTP?ft 104 


6197 


628 


2414 


4200 


5986 


7B4fIP2R 305 


6198 


629 


2415 


4201 


5987 


7fl4C , TP?R 206 


6198 


630 


2416 


4202 


598 8 


7R4f , TP2ft "inn 


o*ii 


631 


2417 


4203 


5989 


! TBAf*riJO'n ihq 


D£l3 i 


632 


2418 


4204 


5990 




o /iy 


633" 


2419 


4205 


CQQ1 


/oltlriiD ill 


DjSAo 


634 


2420 


4206 




/ U IV- Air AO Jl£ 




635 


242" 


i au i 


5993 


TO it rTT mH 1 1 "J 

/ o 4 ulFziJ^3 1 3 


6234 


636 


A*±£Z 


4208 




/ o4 C_XFzo_J 14 


6237 


637 


2423 


4209 


5995 


/olUXirAJi Jib 


6238 


638 


On OA 


42l0 


5996 


/o^Clir^ia Jib 


6239 


639 




4211 


5997 


784CIP2B__317 


6239 


640 


2426 


4212 


5998 


TRAfTDOtt tlft 
t OtV^AtrA-P JlD 




641 


2427 


4213 


5999 


'IQAr^TDOTl no 
^MILli'/y -5Ap 


OA n 


642 


2428 


All A 






ell* 
b244 


643 




AOl C 
"3 AJ.r> 


6001 


7o4Lli'AB 321 


6245 


i 644 




4216 


6092 




6250 


645 1 


A*4J-L 


1Z± / 


6003 


/ B 4 Cl P2JB_3 2 3 


6252 


646 


2432 






*7 O PTDOD OOA 


COO 
b Ai>^ 


647 


2433 


aoi q 




TQ/1 f^TDOD IOC 
/ O * L.J. ** A±J J Ai> 


OCC 

pazjo 


648 


2434" 


4220 


6006 


' O^vXtrAJO j£D 


oc n " " 


649 


2435 


4221 


Ol7U / 


fO^Llr^O -*A / 


6261 


650 


2436 


4222 




7RA^¥po.p too 




651 


2437 


4223 


6009 






652 


2438 


4224 


6010 


f O'tL.XtrAIl jjU 


6266 


653 


2439 


4225 


6011 


7A4f , TP>R m 
» o i V, J. Ir JJl 


6270 


654 


2440 


4226 


6012 




5^71 


■^5"~ 


2441 


4227 


6013 




6274 


656 


2442 


4228 


6014 


7B4CIP2H 115 


6276 


657 


2443 


4229 


6015 


784CIP2B 33S 


6281 


658 


2444 


4230 


6016 


784CIP2B 337 


6281 


659 


2445 


4231 


6017 


784CIP2B 338 


6288 


660 


2446 


4232 


6018 


784CIP2B 339 


6292 


661 


2447 


4233 


6019 


784CIP2B 340 


6294 


662 


2448 


4234 


6020 


784CIP2B 343 


6312 


663 


2449 


4235 


6021 


784CIP2B 344 


6312 


664 


2450 


4236 


6022 


784CIP2B 345 


6312 


665 


2451 


4237 


6023 


784CIP2B346 


6322 


666 


2452 


4238 


6024 


784CIP2B 347 


6324 


667 


2453 


4239 


6025 


784CIP2B 349 


6329 


668 


2454 


4240 


6026 


7B4CIP2B 350 


6331 


669 


24S5 


4241 


6027 


784CIP2B 351 


6333 


670 


2456 


4242 


6028 


784CIP2B 352 


6334 


671 


2457 


4243 


6029 


784CIP2B353 


6337 


672 


2458 


4244 


6030 


784CIP2B 354 


6339 


673 


2459 


4245 


6031 


784CIP2B_355 


6346 


674 


2460 


4246 


6032 


784CIP2B_356 


6348 


675 


2461 


4247 


6033 


784CIP2B 357 


6348 


676 


2462 


4248 


6034 


784CIP2B_358 


6350 


677 


2463 


4249 


6035 


784CIP2B 359 


6351 


678 


2464 


4250 


6036 


784CIP2B 360 


6355 


679 


2465 


4251 


6037 


7B4CIP2B_36l \ 


6362 
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SEQ ID NO: 


SEQ ID 


SEQ ID NO: 


SEQ ID 


Priority 


SEQ ID 


of full- 


NO: of 


of contig 


NO: 


docket nuniber_ 


NO: in 


length 


full- 


nucleotide 


of contig 


corresponding 


U.S. S.N. 


nucleotide 


length 


sequence 


peptide 


SEQ ID NO: in 


09/488,725 


sequence 


peptide 




sequence 


priority 






sequence 






appl i ca t ion 




bBU 


->ac:c 






/84CIP2B 362 


6366 


bol 


2467 


I 4253 


6039 


784CIP2B_363 


6369 


682 


2468 


4254 


6040 


784CIP2B 364 


6371 


boi 




>To"cc 

4<£bb 


f>U41 


784CIP2B 365 


6376 


684 


2470 


4256 


6042 


784C1P2B_366 


6379 


685 


2471 


4257 


6043 


784CIP2B 367 


6380 


686 


2472 


425B 


6044 


784CIP2B 368 


6381 


687 


2473 


4259 


6045 


784CIP2B_369 


6392 


688 


2474 


4260 


6046 


784CIP2B 370 


6395 


689 


2475 


4261 


6047 


784CIP2B 371 


6397 


690 


2476 


4262 


6048 


784CIP2B_J372 


6400 


691 


2477 


4263 


6049 


784CIP2B 373 


6401 


692 


2478 


4264 


6050 


7Q4CIP2B374 


6411 


693 


2479 


4265 


6051 


7B4CIP2B_375 


6411 


694 


2480 


4266 


| 6052 


7B4CIP2B_376 


6411 


695 


2481 


4267 


6053 


784CIP2B_377 


6416 


696 


I 2482 


4268 


6054 


784CIP2B_37B 


641B 


697 


24B3 


4269 


6055 


784CIP2B_379 


6422 


698 


24B4 


4270 


6056- 


784CIP2B__3B0 


6423 


699 


2485 


4271 


6057 


784CIP2B_381 


6426 


700 


2486 


4272 


6058 


784CIP2B_3B2 


6427 


701 


2487 


4273 


6059 


784CIP2B 383 


6428 


702 


2438 


4274 


6060 


784CIP2B_384 


6429 


703 


24 B9 


4275 


6061 


784CIP2B_385 


6430 


704 


2490 


4276 


6062 


784CIP2B_386 


6432 


705 


2491 


4277 


6063 


784CIP2B_387 


6432 


706 


2492 


4278 


6064 


784CIP2B_388 


6438 


707 


2493 


4279 


6065 


7B4CIP2B_389 


6441 


708 


2494 


4280 


6066 


7B4CIP2B_390 


6446 


709 


2495 


4281 


6067 


784CIP2B 391 


6454 


710 


2496 


4282 


6068 


784CIP2B_392 


6459 


711 


2497 


4283 


6069 


784CIP2B 394 


64 61 


712 


2498 


4284 


6070 


784CIP2B_395 


6467 


713 


2499 i 


4285 | 


6071 


784CIP2B_396 


6468 


714 


r~ 2500 


4286 


6072 


7B4CIP2B_397 


6487 


715 


2501 


4287 


6073 


784CIP2B_398 


6491 


716 


2502 


4288 


6074 


7B4CIP2B3 99 


6506 


717 


2503 


4289 


6075 


784CIP2B_401 


6514 


718 


2504 


4290 


6076 


784CIP2B402 


6519 


719 


2505 


4291 


6077 


784CIP2B_403 


6521 


720 


2506 


4292 


6078 


784CIP2B_404 


6532 


721 


2507 


4293 


6079 


784CIP2B^405 


6536 


722 


2508 


4294 


6080 


784CIP2B_406 


6543 


723 


2509 


4295 


6081 


784CIP2B_407 


6544 


724 


2510 


4296 


6082 


7B4CIP2B^408 


6548 


725 


2511 


4297 


6083 


784CIP2B_409 


6551 


726 


2512 


4298 


6084 


784 CIP2B_4 10 


6551 


"ion 




4299 


6085 


no jiriTn^B j t i 

7 B 4<J±P 2B 4 11 


6552 


ion 




4300 


6086 


784CIP2B_412 


6554 


729 


2515 


4301 


6087 


4J.3 


7c cc 


730 




4302 


6088 


7B4CIP2B 414 


6560 


in 

f-ix 


2517 


4303 


6089 


78 4CIP2B_4 15 


656*3 


732 


2518 


4304 


6090 


7A4PT09R die 


6564 


733 


2519 


4305 


6091 


784CIP2B417 


6567 


734 


2520 


4306 


6092 


784CIE»2B_418 


6573 


735 


2521 


4307 


6093 


784CIP2B_419 


6575 


736 


2522 


4308 


6094 


784CIP2B__420 


6577 


737 


2523 


4309 


6095 


784CIP2B_421 


6593 | 


738 


2524 


4310 


6096 | 


784CIP2B_422 


6595 


739 


2525 


4311 


6097 


784CIP2B 423 


6599 


740 


2526 


4312 


6098 


784CIP2B__424 


6625 


741 


2527 


4313 


6099 


784CIP2B 425 


£625 
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SBQ ID NO: 


SBQ ID 


SEQ ID NO; 


SEQ ID 


Priority 


SEQ ID 


Ot tUxA- 


HO: of 


of contig 


NO: 


docket number_ 


NO: in 


length 


IU11- 


nucleobide 


oa. tuncig 


corresponding 


U - o - 5 .£* * 


iiUUXCuLlUC 






pep t ide 


QKfl IT) TOTI- in 
ocy xxj . xu 










sequence 


t»t"L nri t v 






sequence 






appl I ca t i on 




742 


2528 


4314 


6100 


784CIP2B 426 


6626 


743 


2529 


4315 


6101 


784CIP2B 427 


6630 


744 


2530 


4316 


6102 


784CIP2B 428 


6631 


745 


2531 


4317 


6103 


784CIP2B 429 


6632 


746 


2532 


4318 


6104 


784CIP2B 430 


6633 


747 


2533 


4319 


6105 


784CIP2B 431 


6634 


748 


2534 


4320 


6106 


784CIP2B 432 


6638 


749 


2535 


4321 


6107 


784CIP2B 433 


6641 


750 


2536 


4322 


6108 


784CIP2B 434 


6644 


751 


2537 




6109 


784PIP9H 43^ 


6646 


752 




4324 


6110 


7A4f*TP7ft 4"*fC 




753 


2539 


4"^ 7*5 


6111 


7R.4r*TP?tt 4"?7 


6652 


/34 


254 0 




6112 


TflAfTDon Ata a 




ICC 


2541 


4327 


6113 


7ll4PTP9n A1Q 
f OHV~Xtr Ati 4J 3 


ccc7 


TEC 

/SO 




Alio 


dt 1 A 






/3 / 






oj. J.r> 


7ll4rTPOTl AA1 




758 


2544 


4330 


cue 

t> X JLO- 


THAf'TDTO Ail*) 


6664 


■759 


2545 


4331 




f tt 4 L.X Jr /J B_4 4 3 


6668 


760 


2546 


4332 


6118 


/a4ulIr2B_444 


6669 


761 


2547 


4333 


6119 


71Iiir'TDTII A A C 


CC71 
DO / J 


762 


2548 


4334 


6120 






763 


2549 


4335 


6121 


/H4L1P2B 447 


6687 


. _ 


2550 


4336 


6122 


7djir , TD'>tj it ji a 
/tf 4LJ.Jr2Jtl 44o 


6689 


765 










6693 


/DO 


2552 


4338 


6124 




6698 


767 


2553 


4339 


6125 


iaiir>TniD Act 
784UIP2B 451 


6b99 


76B 


2554 


4340 


6126 


/B4C1P2B 452 


6705 


769 


2555 


4341 


6127 


/ o4LJLi*2IJ_45J 


6711 


770 


2556 


4342 


6128 


/ tf4LJ.ir AO 454 


6713 


771 


2557 


4343 


6129 


/o*H_J.fc'AH 455 


6716 


772 




aTaa 


blJU 


/i»4UJ.t*-5l> 45b 


ctoc 

D /ZZ> 


I Til 




A1AC 
4J45 


6131 


/ Q%K+XlfZD l *Z> f 




774 




2560 


A"* AC 

4 J4b 




f O^^XirjiD 'ADO 


D / A / 


/ /-> 


2561 


4347 


fit 

o JL1J 


7fl4r , TP7rl ACQ 


StTS 


/ / 1> 




444D 








777 




4349 


6135 


/ O ^ \_ -L IT j£ ^Ol 


6730 


*7*7fl 

/ /o 


2564 


4350 


6136 


TttAPTPOP 4 CO 


6732 






4351 


6137 


7fl4f*IP7R 4fiH 


6733 


780 


2566 


4352 


6138 


784CTP2B 4 £4 

/cub ii jr^x? noil 


6737 


781 


2567 


4353 


6139 


7S4PTP9R 4fC | > 


6745 


782 


2568 


4354 


6140 


784CIP2B 466 


6751 


783 


2569 


4355 


6141 ! 


784CIP2B 467 


6754 


784 


2570 


4356 


6142 


7B4CIP2B 468 


6758 


785 


2571 


4357 


6143 


784CIP2B 469 


6761 


786 


2572 


4358 


6144 


784CIP2B 470 


6765 


787 


2573 


4359 


6145 


784CIP2B 471 


6768 


788 


2574 


4360 


6146 


784CIP2B_472 


6773 


789 


2375 


4361 


6147 


784CIP2B 473 , 


6776 


790 


2576 


4362 


6148 


784CIP2B_474 


6796 


791 


2577 


4363 


6149 


784CIP2B__475 


6798 


792 


.2578 


4364 


6150 


784CIP2B 476 


6823 


793 


2579 


4365 


6151 


784CIP2B 477 


6825 


794 


2580 


4366 


6152 


784CIP2B_478 


6826 


795 


2581 


4367 


6153 


784CIP2B 479 


6839 


796 


2582 


4368 


6154 


7B4CIP2B_480 


6844 


797 


2583 


4369 


6155 


784CIP2B482 


6849 


798 


2584 


4370 


6156 


784CIP2B_483 


6854 


799 


2585 


4371 


6157 


784CIP2B_484 


6857 


800 


2586 


4372 


6158 | 


784CIP2B_485 


6861 


801 


2587 


4373 


6159 


784CIP2B 486 


6873 


802 


2588 


4374 


6160 


784CIP2B_487 


6875 


803 


2589 


4375 


6161 


784CIP2B_488 


6877 
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CRTl Tn MO • 
isdy XLt yi\J 1 


SEQ ID 


Priority 


SBQ ID 


of full- 


NO: of 


of can t" i rf 


NO: 


oulkcl nuiriDer 


NO : in 


length. 


full- 


nucleot ide 


o£ con tig 


^— -1- A GC7£jUlLwlXlIw 


U S 5 N. 


nucleotide 


length 


sequence 


peptide 


SBQ ID NO: in 


09/488 725 


sequence 


peptide 




sequence 


priority 






sequence 






application 




604 


2590 


4376 


6162 


784CIP2B 489 


6880 


805 


259X 


4377 


6163 


784CIP2B 490 


6885 


806 


2592 


4378 


6164 


784CIP2B_491 


6890 


807 


2593 


4379 


6165 


784CIP2B 492 


6890 


[ 808 


2594 


4380 


6166 


784CIP2B 493 


6894 


809 


2595 


4381 


6167 


784CIP2B 494 


6901 


810 


2596 


4382 


6168 


784CIP2B 495 


6904 


811 


2597 


4383 


| 6169 


784CIP2B 496 


6907 


! 812 


259B 


4384 


6170 


7B4CIP2B 497 


6914 


813 


2599 


4385 


6171 


784CIP2B 498 


6917 


814 


2600 


4386 


6172 


784CIP2B 499 


6923 


815 


2601 


4387 


6173 


784CIP2B 500 


6929 


816 


2602 


4388 


6174 


7B4CIP2B Rm 


6931 


817 


2603 


4389 


6175 


7B4PTP2R ^fi? 


6935 


818 


2604 


4390 


6176 


7B4CTP2R «^nt 




819 


2605 


4391 


1 6177 


784PTP2R miA 


6945 


820 


2606 


4392 


6178 




6946 


821 


2607 


4393 


6 179 


7flAfTDOH cnC 


6947 


822 


2608 




6180 




6949 


823 


2609 


4395 


6181 


/ O ** \<ijr6D 


6959 


824 


2610 


4396 


6182 


7H4PTD9H Cnq 


6960 


825 


/oil 


* j;* / 


Hi ft A 




6962 


826 


2612 


4398 


6184 


/Ol V-lr^sD 311 


D70J 


827 


2613 


4399 


6185 




O70 / 


828 


2614 


4400 






b y a j 


829 


2615 


4401 


CI D»7 


1 o 4 ^ XirAD^p X 9 


6988 


830 


2616 


A A no 


OJLH tj 


ro^^.-lJt'^sJo 3Xi> 


6996 


831 


" 2617 


a a m 




ro4V«JLK*£B bib 


7003 


832 


2618 


4404 




7B4f 1 TD9R ci -> 


/DID 


833 


2619 






7BAPTO")B ci g 
/o4Llf j£D MO 


7017 


834 




4406 '"' 


" g i9 2" 


/OiLlrZc dIj 




835 


2621 


4407 


6193 




7025 


836 


2622 


4408 


6194 


7fl4f*TP?B S21 


7025 


837 


2623 


4409 


6195 


7R4r , TPPn ^77 
/ O^t LpIIT&O 




838 


2624 


4410 


6196 


7R4r*TD3P Ek7T 


7051 


839 


2625 


4411 


6197 


7fl4PTPPft i;9i 


7055 


840 


2626 


4412 


6198 


784CIP7B 


7060 


841 


2627 


4413 


6199 


7R4CTP2B *»2fi 


7064 


842 


2628 


4414 


6200 


784CIP2B 527 


70^7 


843 


2629 


4415 


6201 


784CIP2B 528 


7071 


844 


2630 


4416 


6202 


784CIP2B 529 


7072 


845 


2631 


4417 


6203 


784CIP2B 530 


7073 


846 


2632 


4418 


6204 


784CIP2B 531 


707£ 


847 


2633 


4419 


6205 


784CIP2B 532 


7088 


848 


2634 


4420 


6206 


784CIP2B 533 


7089 


849 


2635 


4421 


6207 


7B4CIP2B__534 


7091 


650 


2636 


4422 


6208 


7B4CIP2B_535 


7091 


851 


2637 


4423 


6209 


784CIP2B 536 


7104 


"852 


2638 


4424 


6210 


784CIP2B537 


7105 


853 


2639 


4425 


6211 


784CIP2B_538 


7105 


854 


2640 


4426 


6212 


784CIP2B_539 


7109 


855 


2641 


4427 


6213 


784CIP2B __540 


7109 


856 


2642 


4428 


6214 


784CIP2B 541 


7119 


857 


2643 


4429 


6215 


784CIP2B_542 


7120 


856 


2644 


4430 


6216 


784CIP2B_543 


7121 j 


859 


2645 


4431 


6217 


784CIP2B_544 


7126 


660 


" 2646^ 


4432 


6218 


784CIP2B 545 


7127 


861 


2647 


4433 


6219 


784CIP2B_546 


7130 


862 


2648 


4434 


6220 


784CIP2B 547 


7131 


863 


2649 


4435 


6221 


784CIP2B 548 


7144 


864 ! 


2650 


4436 


6222 


784CIP2B 549 


7159 


865 


2651 


4437 


6223 


784CIP2B 550 


7163 
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oty UJ 




SEQ ID 


rx lu l 1 uy 




of ful 1 - 


NO : of 




NO: 




NO: in 


length 


full- 


nucleotide 


of con tig 


corresponding 


U.S. S.N. 


nuc 1 e o t i de 


length 


sequence 


peptide 


SBQ ID NO: in 


09/488, 725 


sequence 


peptide 




sequence 


priority 






sequence 






application 




j 866 


2652 


4438 


6224 


784CIP2B 551 


7175 


867 


2653 


4439 


6225 


784CIP2B_552 


7188 


868 


2654 


4440 


6226 


784CIP2B_5S3 


7189 


869 


2655 


4441 


6227 


784CIP2B_554 


7190 


870 


2656 


4442 


6228 


784CIP2B_555 


7191 


871 


2657 


4443 


6229 


784CIP2B_556 


7203 


872 


2658 


4444 


6230 


784CIP2B_557 


7204 


873 


2659 


4445 


6231 


784CIP2B_558 


7208 


874 


2660 


4446 


6232 


784CIP2B_559 


7209 


875 


2661 


4447 


6233 


784CIP2B 560 


7210 


876 


2662 


4448 


6234 


784CIP2B 561 


7216 


877 


2663 


4449 


6235 


784CIP2B 562 


7221 


878 


2664 


4450 


6236 


784CIP2B 563 


7230 


879 


2665 


4451 


6237 


784CIP2B 564 


7237 


880 


2666 


4452 


6238 


784CIP2B S65 


7240 


881 


2667 


4453 


6239 


784CIP2B 566 


7245 


882 


2668 


4454 


6240 


7B4CIP2B 567 


7250 


883 


2669 


4455 


6241 


784CIP2B 568 


7251 


884 


26 70 


4456 


6242 


784CXP2B 569 


7255^ 


□ OC 
003 


2671 


4457 




7ft4rrP9R 5*7n 


7260 


886' 


2672 


4458 


6244 


7R4f*TP2B 571 


7265 


887 


<o to 




6245 


7fliniP2B 572 


7268 


floa 
ooo 


2674 


4460 


6246 


7B4CXP2B 573 


7275 






"4461 


g247 ■ 


7B4CTP2B 574 


7279 


890 


2676 


4462 


6248 


7B4CIP2B 575 


7283 


O J* J. 


2677 




6249 




7283 




2678 


4464 


6250 


784CXF2B 577 


7287 


893 


"2T79 


4465 


6251 


784CIP2B 578 


7301 


894 


2680 


4466 


6252 


784CIP2B 579 


7308 


895 


2681 


4467 


6253 


784CIP2B 580 


7308 


896 


2682 


4468 


6254 


784CIP2B 581 


7309 


897 


26*83 


4469 


6255 


784CIP2B 582 


7319 


898 


2684 


4470 


6256 


" 784CIP2B 583 


7320 


899 


2685 


4471 


6257 


784CIP2B 584 


7326 


900 


2686 


4472 


5258 


784CIP2B 585 


7326 


901 


2687 


4473 


6259 


784CXP2B 586 


7334 


902 


2688 


4474 


6260 


784CIP2B 587 


7337 


903 


2689 


4475 


6261 


784CIP2B 588 


7339 


904 


2690 


4476 


6262 


784CIP2B__589 


7344 


905 


2691 


4477 


'6263 


784CIP2B_590 


7355 


906 


2692 


4478 


6~2£4 


784CIP2B_591 


7363 


907 


2693 


4479 


6265 


784CIP2BJ592 


7363 


908 


.2694 


4480 


6266 


784CIP2B593 


7365 


909 


2695 


4481 


6267 


784CIP2B_594 


7368 


910 


2696 


4482 


626$ 


7846tP2B_595 


7369 


911 


2697 


4483 


6269 


784CIP2B_596 


7372 


912 


2698 


4484 


6270 


784CIP2B_599 


7375 


913 


2699 


4485 


6271 


784CIP2B600 


7381 


914 


2700 


4486 


6272 


784CIP2B 601 


7383 


915 


2701 


4487 


6273 


784CIP2B602 


7387 


916 


2702 


4488 


6274 


784CIP2B_6 03 


7391 


917 


2703 


4489 


6275 


784CIP2B_604 


7393 


918 


2704 


4490 


6276 


784CIP2B_605 


7395 


919 


2705 


4491 


6277 


7B4CIP2B_606 


7397 


920 


2706 


4492 


6278 


7B4CIP2B 607 


7399 


921 


2707 


4493 


6279 


784CIP2B 608 


7405 


922 


270 B 


4494 


6280 


784CIP2B 609 


" 7406" 


923 


2709 


4495 


6281 


7B4CIP2B_610 


7406 


924 


2710 


4496 


6282 


784CIP2B_611 


7409 


925 


2711 


4497 


6283 


784CIP2B_612 


7410 


926 


2712 


4498 


6-284 


7B4CIP2B_613 


7411 


927 


2713 


4499 


6285 


784CIP2B 614 


7417 
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or*.IJ J.D viKJ ' 


SEQ ID 


S3Q ID NO: 


SEQ ID 


Priority 


SEQ ID 


of full- 


NO- of 


of confcicj 




docket number^ 


NO: in 


length 


full- 


•>»!»— "J t Ada. 

uuvicubi Lie; 








nucl eo t ide 


length 




peptide 


SEO ID NO« in 


OQ/dHft 72«; 


sequence 


peptide 




sequence 


priority 






sequence 






app 1 ica t ion 




528 


2714 


4500 


6286 


784CIP2B 615 


7418 


929 


2715 


4501 


6287 


784CIP23 616 


7421 


930 


2716 


4502 


6288 


784CIP2B 617 


7422 


931 


2717 


4503 


6289 


784CIP23 618 


7422 


932 


2718 


4504 


6290 


784CIP2B 619 


7423 


933 


2719 


4505 


6291 


784CIP23 620 


7424 


934 


2720 


4506 


6292 


784CIP?3 621 


74 2 6 


935 


2721 


4507 


6293 


784ClP2 T i 622 


7427 


936 


2722 


4508 


6294 


7B4CIP2B 623 


7428 


937 


2723 


4509 


6295 


7R4PTP2R #»74 


7430 


938 


2724 


45ld 


6296 


7HAf ,1 rp2a 

/ g^^irJIo O ^ U 


7435 


939 


2725 


ACi i 


XHi? / 






940 


2726 


4512 


6298 






941 


2727 


4513 




/04V»J.tr2t* t>Z D 


7440 


942 


2728 


acta 




/o4l~AJrj£ii b2!# 


7442 


943 




4515 


6301 


/ o 4 1.1 P2 B__o3 0 


7450 


944 


2730 


4516 


6302 




7451 


QA c. 


o*in 

1 J JL 


4517 


6303 




7452 


QAC 


2732 


4518 


6304 


784CIP23_633 


" nAdA 

7454 


94 V 


2733 


4519 


6305 


/84C1P2B ©34 


7457 


948 


2734 


4520 


6306 


7B4CIP2B_635 


7459 


QA Ck 


273S 


4521 


6307 


7B4CIP2B__63o 


7461 


95 0 


A /ib 


4522 


6308 


784CIP2B_G37 


7463 


bci 


2737 


4523 


6309 


r84CIP2B 638 


7466 




2738 


AC") A 


6310 


/84CIP2B__633 


7469 






4525 


6311 


784CiP2B_o4 0 


7473 






It5c 

4=>26 


cno 

6312 


/H4C1P2B_541 


j 7481 


qtii 





Ag'*3T " 


ciii 


/84 CIP2B__64 2 


74B2 




2742 


4529 


6314 


784CIP2B_643 


7482 


OCT 


2743 


4529 


6315 


/H4C.1P2B_644 


7483 


958 


2744 




bJlb 


/ o 4 X Yd. B__64 b 




959 


2745 


*ir>j jl 


em 7 






960 


2746 




D-5 iff 


no A Q CA1 


7487 


961 




4533 


6319 




7491 


962 


2748 






/ a 4L1 F2i»__©4 3 


7492 


963 


2749 










964 


27$0 


4536 


6322 


/o4V^J.lrjfcJ9 obi 


7498 


965 


2751 


4537 


6323 


/ CJ*H-. JL Jr/iJt* D3j£ 




966 


2752 


4538 


6324 




7CAQ i 
/DUO 


967 


2753 


4539 


6325 


* « *XV — l IT O IJ M 


7516 


96*8 


2754 




6326 




7518 


969 


2755 


4541 


6327 


784CIP2B ^SG 


7519 


970 


2756 


4542 


6328 


784CIP2B 657 


7521 


971 


2757 


4543 


6329 


784CIP23 658 


7529 


972 


2758 


4544 


6330 


784CIP2B 659 


7532 


973 


2759 


4545 


6331 


784CIP23 660 


7533 


974 


2760 


4546 


6332 


784CIP2B 661 


7535 


975 


2751 


4547 


6333 


784CIP2B 662 


7545 


976 


2762 


4548 


6334 


784CrP2B_663 


7546 


977 


2763 


4549 


6335 


784CIP2B 664 


7552 


978 


2764 


4550 


6336 


784CIP2B_665 


7554 


979 


2765 


4551 


6337 


784CIP2B 666 


7567 


980 


2766 


4552 


6338 


784CIP23_667 


7569 


981 


2767 


4553 


6339 


784CIP2B668 


7575 


982 


2768 


4554 


6340 


784CIP23_669 


7576 


983 


2759 


4555 


6341 


784CIP23_670 


7577 


984 


2770 


4556 


6342 


784CIP2B67X 


7579 


985 


2771 


4557 


6343 


784CIP23_672 | 


7582 


986 


2772 


4558 


6344 


784CIP2B_673 


7587 


987 


2773 


4559 


6345 


784C1P23 674 


7589 


988 


2774 


4560 


6346 


" 784CIP2B_675 


7597 


989 


2775 


4561 


6347 


784CIP2B_676 


7597 
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SBQ ID NO: 




HRrt TT\ Ton. 




Priori ty 


SEQ ID 


of full- 


NOr of 


ftf con }■ i or 


NO: 


vAuciveu numpejc 


NO: in 


length 


full- 


nucleotide 


of con tig 




U.S. S.N. 


nucleotide 


length 


sequence 


peptide 


SBQ ID NO: in 


09/488 735 


sequence 


peptide 




sequence 


priority 






sequence 






appl i ca t i on 




990 


2776 


[ 4562 


6348 


784CIP2B 677 


7609 




[ 2777 


4563 


6349 


784CIP2B 678 


7609 


992 


2778 


4564 


6350 


' 7B4CIP2B 679 


7609 


993 


2779 


4565 


" 6351 


784CIP2B 680 


7613 


| 994 


27B0 


4566 


6352 


784CIP23 681 


7623 


99S 


27B1 


4567 


6353 


784CIP23 682 


7629 


996 


2782 


4568 


6354 


784CIP2B 683 


7630 


997 


2783 


4569 


6355 


784CIP2B 684 


7633 


999 


2784 


4570 


6356 


784CIP2B 685 


7635 


999 


2785 


4571 


6357 


784CIP2B 686 


7638 


1000 


2786 


4572 


6358 


784CTP7P* G~A7 


7639 


1001 


2787 


4573 


6359 


784CIP5B Cftfl 

# O tVXE j£.D DOO 




1002 


2788 


4574 


6360 


784CTP7TI fiftQ 


7647 


1003 


2789 


4575 


6361 


784CTP7R fiO,fl 


k%a n 


1004 


2790 


4576 


6362 


7ftAr'Ti»on cot 

» CI ft Llr/D 02#JL 


rOoo [ 


1005 


2791 


4577 


6363 


7ftdr*TP!>n fIOt> 


7664 


1006 


2792 


4578 


6364 


7fldPTD')n CQ'l 


"7££ A 


1007 


2793 


4579 


6365 




7674 


1008 


2794 


4580 


DODO 


/ t>%L.JLJr.«ip t>3b 


7675 


1009 


2795 


4581 


636*7 




7676 


1010 


2796 


4582 


6368 


784CIP2B 69R 


7681 


1011 


2797 


4583 


i 6369 


"JBifTDnn coo 


7688 


1012 


2798 


4584 


oo ryf 


70* /"•T DTD Tnn 
t OH.L.±krAt> /UU 


7693 


1013 


2799 


4585 


6371 




7694 


10 iJT ~ 


2800 


45"86 


6*372 




7715 


1015 


2801 


4587 


6373 


7(14f'TD')n *7M 

/ 0 4 Jr <s Ja^_ / U J 


7716 


1016 


2802 


4588 






7718 


1017 


2803 


45B9 


6375 


T n A r»TT>^3 P.' Trie 


•7*701 


1018 


2804 


4E>90 


6376 


*7 RAPT POP *7fltt 




1019 


2805 


4591 


6377 


TftAPTDOt* "7 r\"7 
/o«LA*VD /U / 


7729 


1020 


2806 


4592 


6378 


/ Oflk L.Xii!D / I/O 


— 77T1 


1021 


2807 


4593 


6379 




/ / 


1022 


2808 


4594 


6380 




T7A 1 


1023 


2809 


4595 


6381 


7fl4PTP9R 711 


7743 


1024 


2810 


4596 


63B2 


*7HAPT©OT* "71 ? 
/01V.A1taX) f X*. 


"7*TilQ 


1025 


2811 


4597 


6383 


/ o«v.lr,£D »Xj 


1 t*kir 


: 1026 


2812 


4598 


6384 


784fIP*>'R 714 * 


7750 


1027 


2813 


4599 


6385 


7B4CTP^>P. ?m 




1028 


2814 


4600 


6386 


784CIP2B 716 


7759 


1029 


2815 


4601 


6387 


784CIP2B 717 


7760 


1030 


2816 


4602 


6388 


784CIP2B 718 


7760 


1031 


2817 


4603 


6389 


784CIP2B 719 


7764 


1032 


2818 


4604 


6390 


784CIP2B 720 


7765 


1033 


2819 


4605 


6391 


7B4CIP2B 721 


7756 


1034 


2820 


4606 


6392 


784CIP2B 722 


7767 


1035 


2B21 


4607 


6393 


784CIP2B 723 


7769 


1036 


2822 


4608 


6394 


784CIP2B 724 


7770 


1037 


2823 


4609 


6395 


784CIP2B_725 


7774 j 


1038 


2824 


4610 


6396 


784CIP2B 726 


7779 


1039 


2825 


4611 


6397 


784CIP2B_727 


77B1 


1040 


2826 


4612 | 


6398 


784CIP2B 728 


77B2 


1041 


2827 


4613 


6399 


784CIP2B_729 


7783 


1042 


2828 


4614 


6400 


784CIP2B 730 


7787 


1043 


2829 


4*15 


6401 


784CIP2B 731 


7792 


1044 


2830 


4616 


6402 


7B4CIP2B 732 


7795 


1045 


2831 


4617 


6403 


784CIP2BJ733 


7801 


1046 


2832 


4*18 


6404 


7B4CIP2B 734 


7807 


1047 


2B33 


4619 


6405 


784CIP23735 


7808 


1048 


2834 


4620 


6406 


784CIP23_736 


7819 


1049 


2835 


4621 


6407 


784CIP2B 737 


7824 


1050 


2836 


4622 


6408 


784CIP2B 738 


7826 


1051 


2837 


4623 i 


6409 


784CIP2B739 | 


7829 
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SEQ ID NO: 
of full- 
length 
nucleotide 
sequence 


SEQ. tD 
NO; of 
full- 
length 
peptide . 
sequence 


"SEQ ID NO: 
of con tig 
nucleotide 
sequence 


SEQ ID 
NO: 

of contig 

peptide 

sequence 


Priority 
docket number 
cor respond i ng 
SEQ ID NO: in 
priority- 
application 


SEQ ID 
NO; in 
U.S. S.N. 
09/488,725 


1052 


2838 


4624 


6410 


784CIP2B 740 


7832 


1053 


2839 


4625 


6411 


784CIP2B_741 


7839 


1054 


2840 


4626 


6412 


784CIP2B 743 


7847 


1055 


2841 


4627 


6413 


784CIP2BJ744 


7848 


1056 


2847. 


4628 


6414 


784CIP2B745 


7B53 


1057 


2843 


4629 


6415 


784CIP2B 746 


7854 


10S8 


2844 


4630 


6416 


784CIP2B_747 


7856 


1053 


2845 


4631 


6417 


784CIP2B_748 


7862 


1060 


[ 2846 


4632 


6418 


784CIP2B 749 


7865 


1061 


2847 


4633 


| 6419 


784CIP2B750 


7874 


1002 


2848 


4634 


6420 


784CIP2B 751 


7877 


1063 


2849 


4635 


6421 


7B4CIP2BJ7S2 


7880 


1064 


2850 


4636 


6422 


784CIP2B 753 


7882 


1065 


2851 


4637 


6423 


784CIP2B 754 


7884 


1066 


2852 


4638 


6424 


784CIP2BJ755 


7886 


1067 


2853 


4639 


6425 


784CIP2B_756 


7888 


1068 


2854 


4640 


6426 


784CIP2BJ757 


7889 


1069 


2855" 


[ 4641 


6427 


784CIP2B758 


7901 


1070 


2856 


4642 


6428 


7B4CIP2B_759 


7910 


1071 


2857 


4643 


6429 


784CIP2B 760 


7911 


1072 - 


2858 


4644 


6430 


784CIP2B_761 


7921 


1073 


2859 


4645 


6431 


7B4CIP2B 762 


7923 


1074 


2860 


4646 


6432 


7B4CIP2B 763 


7924 


1075 


2861 


4647 


6433 


784CIP2B_764 


7925 


1076 


2862 


4648 


6434 


7B4CIP2B 765 


j 7928 


1077 


2863 


4649 


6435 


784CIP2B 766 


7929 


107B 


2864 


4650 


6436 "" 


784CIP2B 767 


7930 


1079 


2865 


4651 


6437 


784CIP2B 768 


7934 


1080 


2866 


4652 


6438 


7B4CIP2BJ769 


793B 


1081 


2367 


4653 


6439 


784CIP2B 770 


7942 


1062 


2868 


4654 


6440 


784CIP2B 771 


7945 


10B3 


2869 


4655 


6441 


7B4CIP2B_772 


7946 


10B4 


2870 


4656 


6442 


784CIP2B_773 


7948 


10B5 


2871 


4657 


6443 


784CIP2B 774 


7951 


10B6 


2872 


4658 


6444 


784CIP2BJ775 


7952 


1087 


2873 


4659 


6445 


784CIP2B_776 


7953 


1088 


2874 


4660 


6446 


7B4CIP2B_777 


7954 


1089 


2875 


4661 


6447 


7B4CIP2BJ77B 


7957 


1090 


2876 


4662 


6448 


784dP2B_779 


7958 


1091 


2877 


4663 


6449 


784CIP2B_780 


7961 


1092 


2878 


4664 


6450 


784CIP2B_7B1 


7965 


j 1093 


2879 


4665 


6451 


784CIP2B7B2 


7966 


1094 


2880 


4666 


6452 


784CIP2B_7B3 


7979 


1095 


2881 


4667 


6453 


784CIP2B 784 


7986 


1096 


2882 


4668 


6454 


784CIP2B 785 


7986 


1097 


2BB3 


4669 


6455 


784CIP2B_786 


7988 


1098 


2884 | 


4670 


6456 


784CIP2B_787 


7991 


1099 


2885 


4671 


6457 


784CIP2B 788 


7992 


1100 


2886 


4672 


6458 


784CIP2B789 


7992 


1101 


2887 


4673 


6459 


784CIP2B 790 


7992 


1102 


288B 


4674 


6460 


784CIP2B_791 


7992 


1103 


2B89 


4675 


6461 


784CIP2B 792 


8003 


1104 



2890 


4676 


6462 


784CIP2B_793 


8014 


1105 


2891 


4677 


6463 


784CIP2BJ794 


8015 


1106 


2892 


4678 


6464 


784CIP2B795 


8016 


1107 


2893 


4679 


"6465 


784CIP2B 796 


8017 


1108 


2894 


4680 


6466 


784CIP2B_797 


8019 


1109 


2895 


4681 


6467 


784CIP2B 798 


8020 


1110 


2896 


4682 


6468 


784CIP2B_799 


8022 


1111 


"2897 


4683 


6469 


784CIP2B 800 


8022 


1112 


2898 


4684 


6470 


784CIP2B B01 


8028 


1113 


2899 


4685 


6471 | 


784CIP2B 802 


8030 
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SEQ ID NO: 


SEQ ID 


SEQ ID NO: 


SEQ ID 


Priority 


SEQ ID 


of full- 


NO: of 


of contig 


NO: 


docket number^ 


NO: in 


length 


full- 


nucleotide 


of contig 


corresponding 




nucleotide 


length 


sequence 


peptide 


SEQ ID NO: in 


09/488,725 


sequence 


peptide 
sequence 




sequence 


priority 
application 




1114 


2900 


4666 


6472 


784CIP2B_803 


8038 


1115 


2901 


4687 


6473 


784CIP2B 804 


8042 


1116 


2902 


4688 


6474 


7B4CIP2B_B05 


8045 


1117 


2903 


4689 


6475 


784CIP2B_806 


8045 


1118 


2904 


4690 


6476 


784CIP2B_B07 


8046 


1119 


2905 


4691 


6477 


784CIP2B_808 


8047 


1120 


2906 


4692 


6478 


784CIP2B_B09 


8051 


1121 


2907 


4693 


6479 


784CIP2B 810 


8059 


1122 


2908 


4694 


6480 


784CIP2B_811 


6064 


1123 


2909 


469S 


6481 


784CIP2B_812 


B069 


1124 


2910 


4696 


6482 


784CIP2B_813 


8074 


1125 


2911 


4697 


6483 


784CIP2B_814 


8077 


1126 


2912 


4698 


6484 


7B4CIP2B_8l5 


8078 


1127 


■ 2913 


4699 


6485 


7B4CIP2B 816 


8079 


1128 


2914 


4700 


6486 


784CIP2B817 


8084 


1129 


2915 


4701 


6487 


784CIP2B 818 


8088 


1130 


291* 


4702 


6488 


7B4CIP2B819 


8090 


1131 


2917 


4703 


6489 


784CIP2B 820 


8091 


1132 


2918 


4704 


6490 


784CIP2B 821 


8099 


1133 


2919 


4705 


6491 


7B4CIP2B_822 


8099 


1134 


2920 


4706 


6492 


7B4CIP2B 823 


8100 


1135 


2921 


4707 


6493 


784CIP2B 824 


8102 


1136 


2922 


4708 


6494 


784CIP2B_825 


8103 


1137 


2923 


4709 


6495 


784CIP2B_826 


8103 


1138 


2924 


4710 


6496 


784CIP2B 827 


8104 


1139 


2925 


4711 


6497 


784CIP2B_82B 


8108 


1140 


2926 


4712 


6498 


784CIP2B_829 


8110 


1141 


2927 


4713 


6499 


784CIP2B_830 


8116 


1142 


292 B 


4714 


6500 


7B4CIP2B_B31 


8117 


1143 


2929 


4715 


6501 


7B4CIP2B_832 


8123 


1144 


2*30 


4716 


6502 


7B4CIP2B_833 


8130 


1145 


2931 


4717 


6503 


784CIP2B_834 


8130 


1146 


2932 


4 718 


6504 


784CIP2B_B35 


8143 


1147 


2933 


4719 


6505 


784CIP2B 836 


8143 


1148 


2934 


4720 


6506 


784CIP2B 837 


8154 


1149 


2935 


4721 


' 6507 1 


784CIP2B_838 


8155 


1150 


2936 


4722 


6508 


784CIP2B_839 


8162 


1151 


. 2937 


4723 


6509 


784CXP2B 840 


8163 


1152 


2938 


4724 


6510 


784CIP2B 841 


8172 


1153 


2939 


4725 


6511 


784CIP2B842 


8173 


11S4 


2940 


4726 


6512 


784CIP2B_843 


8179 


1155 


\ 2941 


4727 


6513 


784CIP2B_844 


8182 


1156 


2942 


4728 


6514 


784CIP2B 845 


8183 


1157 ■ 


2943 


4729 


6515 


784CIP2B_846 


8184 


1158 


2944 


4730 


6516 


784CIP2B_847 


8185 


1159 


2945 


4731 


6517 


784CXP2B w 848 


8187 


1160 


2946 


4732 


6518 


784CIP2B 849 


8188 


1161 


2947 


4733 


6519 


784CIP2B_B50 


8190 


1162 


294 8 


4734 


6520 


784CIP2B 851 


| 8190 


1163 


2949 


4735 


6521 


784CIP2BB52 


8192 


1164 


2950 


4736 


6522 


784CIP2B_853 


8193 


1165 


2951 


4737 


6523 


7B4CIP2B854 


8197 


1166 


2952 


4738 


6524 


784CIP2B_855 


8197 


1167 


2953 


4739 


6525 


784CIP2B_856 


8199 


1168 


2954 


4740 


6526 


7B4CIP2B857 


8202 


1169 


2955 


4741 


6527 


784CIP2BB58 


8203 


1170 


2956 


4742 


6528 


7B4CIP2B_859 


8208 


1171 


2957 


4743 


. 6529 


784CIP2B 860 


8209 


1172 


2958 


4744 


6530 


784CIP2B861 


8211 


1173 


2959 


4745 


6531 


784CIP2B 862 


8214 


1174 


2960 


4746 


6532 


784CIP2B 063 


8217 


1175 


2961 


4747 


6533 


784CIP2B 864 


8223 
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SEQ ID NO i 


SEQ ID 


SEQ ID NO: 


SEQ ID 


Priority 


SEQ ID 


of full- 


NO: of 


of contig 


HO: 


docket number^ 


NO: in 


length, 
nucleotide 


tull- 
length 


nucleotide 
sequence 


ot concig 

peptide. 


corresponding 
oKy id no: in 


U.S. S.N. 

f\ f-\ /i AO rtnr 

09/4B8, 725 




pepciae 

"V. ^UCl IV. 




b e c[ucuc e 


priority 

^innl "i r^i^i ^~ 4 rvn 




1176 


.2962 


4748 


6534 


7R4("*TP?n RfiS 


8224 


1177 


2963 


4749 


6535 


/ o'i^xir its ODD 


QT>(T 


1178 


2964 


4750 


6536 


7R4PTP?ft HC7 


u**j # 


1179 


2965 


4751 


6537 


f 0 _L Jr Z £3 ODD 


8229 


1180 




4752 


6538 




8232 


i 1181 


2967 




6539 


*7A4r*TDOii bin 


8236 


1182 


2968 


4754 


bMV 


/o^Llrjiti 0 / ± 


8239 


1183 




** for* 


6541 




8244 




2970 


4756 






8245 


1185 


2971 


4757 


6543 




8248 


1186 


2972 


a Vcb 

4758 


6544 


784CIP2B 875 


8251 


1187 


2973 


4759 


6545 


784CIP2B_87o 


8253 


1188 


2974 


4760 


6546 


784CXF2B 877 


8260 


1189 


2975 


4761 


6547 


784CIP2B_878 


8262 _ 


1190 


2976 


4762 


6546 


784CIP2B 879 


8268 


1191 


2977 


4763 


6549 


784CIP2B BB0 


8270 


1192 


2978 


4764 


6550 


784CIP2B_8B1 


8272 


1193 


2979 


4765 


6551 


784CIP2B_882 


8274 


1194 


2980 


4766 


6552 


784CIP2B_B83 


8274 


1195 


2981 


4767 


6553 


784CIP2B 884 


8275 


1196 


2982 


4768 


6554 


784CIP2B_885 


8277 


1197 


2983 


4769 


6555 


784CIP2B 886 


8281 


1198 


2984 


4770 


6556 


784CIP2B 887 


B283 


1199 


2985 


4771 


6557 


784CIP2B 888 


8289 


1200 


2986 


4772 


6558 


784CIP2B 889 


8295 


1201 


2987 


4773 


6559 


784CIP2B B90 


8300 


1202 


2988 


4774 


6560 


784CXP2B 891 


8303 


1203 


2989 


4775 


6561 


784CIP2B_892 


8304 


1204 


2990 


4776 


6562 


784CIP2B_893 


8305 


1205 


2991 


4777 


6563 


784CIP2B_894 


8309 


1206 


2992 


477B 


6564 


784CIP2B_895 


8318 


1207 


2993 


4779 


6565 


784 CIP2B__8 96 


8319 


1208 


2994 


4780 


6566 


784CIP2B__897 


8321 


1209 


2995 


4781 


6567 


784CIP2B__898 


8322 


1210 


2996 


4782 


6568 


784CIP2B_899 


8323 1 


1211 


2997 


4783 


6569 


784CIP2B_900 


8325 


1212 


2998 


4784 


6570 


784CIP2B__901 


8331 


1213 


2999 


4785 


6571 


7 84 CI P2B__9 0 2 


8332 


1214 


3000 


4786 


6572 


784CIPZB 903 


8333 


1215 


3001 


4787 


6573 




8335 




3002 


4768 


6574 


ioapttjod one: 


8336 


1217 


juuj 


4789 


6575 




8337 


1218 


30 04 


4790 


D3 / D 


/ ofliLlr 4£r> iru / 


8340 


1219 


3005 


4/91 


6577 




BMJ 


1220 


3006 


4792 




7R4r*TP2P. tJHQ 


8347 ' 


1221 


3007 


"± t yO 


op j y 






1222 


3008 


4794 


6580 


7R4rTP77l <11 1 


8351 


1223 


3 009 


4795 


6581 




8353 


1224 


3010 


4796 


6582 




8355 


1225 


3011 


4797 


6583 


7B4CTP2B 914 


8361 


1226 


3012 


4798 


6584 


# O** V— .Ltr -£.P 


OJOJ 


1227 


3013 


A'/qq 




•7RAPTDOP 91 £ 


Ojo / 


1228 


3014 


480O 


65B6 


784CIP2B_917 


8369 


1229 


3015 


4801 


6587 


784CIP2B_919 


8375 


1230 


3016 


4802 


6588 


784CIP2B_920 


8387 


1231 


3017 


4803 


• 6589 


784CIP2B_921 


8391 


1232 


3018 


4804 


6590 


7B4CIP2B922 


8393 


1233 


3019 


4805 


6591 


784CIP2B 923 


8393 


1234 


3020 


4806 


6592 


784CIP2B 924 


8394 


1235 


3021 


4807 


6593 


7B4CIP2B_925 


8395 


1236 


3022 


4808 


6594 


784CIP2B„926 


8396 


1237 


3023 


4809 


6595 


784CXP2B 927 


8398 
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SEQ ID NO: 


S BQ ID 


SEQ ID NO: 


SEQ ID 


Priority 


SEQ ID 


of full- 


NO: of 


of contlg 


ND: 


docket number^ 


NOrin 


length 


full- 


nucleotide 


of contig 


corre sponding 


U.S. S.N. 


nucleotide 


length 


sequence 


peptide 


SEQ ID NO: in 


09/488,725 


cequence 


peptide 
sequence 




sequence 


priority 
application 




1238 


3024 


4810 


6596 


7B4CIP2B_928 


8402 


1239 


3025 


4811 


6597 


784CIP2B_929 


8402 


1240 


3026 


4812 


6598 


784CIP2B_930 


8405 


1241 


3027 


4813 


6599 


784CIP2B_931 


8406 


1242 


3028 


4814 


6600 


784CIP2B_932 


8409 


1243 


3029 


4815 


6601 


7B4CIP2B_933 


8410 


1244 


3030 


4816 


6602 


784CIP2B_934 


6414 . 


1245 


3031 


4817 


6603 


784CIP2B935 


8415 


1246 


3032 


4818 


6604 


7B4CIP2B 936 


8419 


1247 


3033 


4819 


6605 


784CIP2B_937 


8426 


| 1248 


3034 


4820 


6606 


784CIP2B_938 


8430 


1249 


3035 


4821 


6607 


784CIP2B 939 


8431 


1250 


3036 


4822 


6608 


784CIP2B_940 


8432 


1251 


3037 


4823 


6609 


784CIP2B_941 


8433 


1252 


3038 


4824 


6610 


784CIP2B 942 


8434 


1253 


3039 


4B25 


6611 


784CIP2B_943 


8438 


1254 


3040 


4826 


6612 


7B4CIP2B_944 


8439 


1255 


3041 


4827 


6613 


784CIP2B_945 


6441 


1256 


3042 


4828 


6614 


784CIP2B_946 


8450 


1257 


3043 


4829 


6615 


7B4CIP2B_947 


8451 


1258 


3044 


4830 


6616 


7B4CIP2B„94B 


■ 8452 


1259 


3045 


4831 


6617 


784CIP2B_949 


8460 


1260 


3046 


4832 


6618 


784dP2B_950 


8461 


1261 


3047 


4833 


6619 


784CIP2B_951 


8462 


1262 


3048 


4834 


6620 


784CIP2B 952 


8464- 


1263 


3049 


4835 


6621 


784dP2B_953 


8465 


1264 


3050 


4836 


6622 


784CIP2B 954 


8467 


1265 


3051 


4 837 


6623 


784CIP2B_955 


8470 


1266 


3052 


4838 


6624 


784CIP2B_956 


8471 


1267 


3053 


4839 


6625 


784CIP2B_957 


8473 


1268 


3054 


4840 


6626 


784CIP2B_95B 


8474 


1269 


3055 


4 841 


6627 


784CIP2B 959 


6475 


1270 


3056 


4842 


6628 


784CIP2B 960 


8476 


1271 


3057 


4843 


6629 


7B4CIP2B961 


8480 


1272 


3058 


4844 


6630 


784CIP2B 962 


8482 


1273 


3059 


4845 


6631 


784CIP2B_963 


8482 


1274 


3060 


4B46 


6632 


784CIP2B964 


8486 


1275 


3061 


4847 


6633 


784CIP2B_965 


8488 


1276 


3062 


4848 


6634 


784CIP2B_966 


8492 


1277 


3063 


4849 


6635 


784CIP2B 967 


8494 


1278 


3064 


4850 


6636 


784CIP2B_968 


8406 


1279 


3065 


4851 


6637 


784CIP2B969 


8497 


1280 . 


3066 


4852 


6638 


784CIP2B_970 


8499 


1281 


3067 


4853 


6639 


784CIP2B_971 


8513 


1282 


3068 


4854 


6640 


784CIP2B_972 


6522 


1283 


3069 


4655 


6641 


784CIP2B973 


8526 


1284 


3070 


4856 


6642 


784CIP2B 974 


B531 


1285 


3071 


4857 


6643 


784CIP2B_975 


8533 


1286 


3072 


4858 


6644 


784CIP2B_976 


8542 


1287 


3073 


4859 


6645 


784CIP2B 977 


8544 


1280 


3074 


4860 


6646 


784CIP2B_978 


8565 


| 1289 


3075 


4861 


6647 


784CIP2B_979 


8565 


1290 


3076 


4862 


6648 


784CIP2B_980 


B572 


1291 


3077 


4863 


6649 


7B4CIP2B_981 


B576 


1292 


3078 


4864 


6650 


7B4CIP2B_982 


857B 


1293 


3079 


4865 


6651 


784CIP2B_983 


8584 


1294 


3080 


4666 


6652 


7B4CIP2B_984 


859B 


1295 


3081 


4867 


6653 


784CIP2B98S 


8602 


1296 


3082 


4868 


6654 


784CIP2B986 


8604 


1297 


3083 


4869 


6655 


784CIP2B_987 


8609 


1298 


3084 


4870 


6656 


784CIP2B_98B 


8612 


1299 


3085 


4871 


6657 


784CIP2B_989 


8637 
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SEQ ID NO: 


SEQ ID 


SEQ ID NO: 


SEQ ID 


Priority 


SEQ ID 


of full- 


NO: of 


of contig 


NO: 


docket number^ 


NOrin 


length 

IlUCieOulUe 


XU11- 

length 


nucleotide 
sequence 


of contig 
peptide 


corresponding 

t»CAJ LLt H\J ' XXV 


n.s .s.n. 

no / A a a lie 

09/488 , 725 




s & cm pnc'f* 






piioncy 

;»tvt\1 4 r*;it* "i f\n 

ct^JJ^J. XUrtUiUU 




Dob 


3086 


4872 


6658 


784CIP9R 9Qn 


8640 


1301 


3087 


4B73 


6659 


7ft4f*TP7R qqi 




1302 


3088 


4874 


6660 


7ft4rTP9H QQ1 


8645 i 


1303 


3089 


4875 


6661 


7B4r"TP9R 


8650 


1304 


3090 


4876 


6662 




6651 


J. J no 


3091 


4877 


6663 




8654 


1306 


3 092 


4878 


6664 


/ O'tV^JLx'.Z.D 


8655 


130*/ 


3093 


4879 


6665 




0CC7 


n inn 


""10 51" """" 




oobb 


/ d K — Lr jjo 


8665 


13 09 


3095 


4881 


6667 


7B4t-XP2B_939 


8668 


1310 


3096 


4882 


6668 


f84CIP2B 1000 


8671 


1311 


3097 


4883 


6669 


784CIP2B__1001 


8672 


1312 


3098 


4884 


6670 


784CI P2B_10 02 


8692 


1313 


3099 


4885 


6671 


784CIP23_1003 


87C6 


1314 


3100 


4886 


6672 


784CIP23_1004 


8716 


1315 


3101 


4887 


6673 


784CIP2B 1005 


8719 


1316 


3102 


4BB8 


6674 


784CIP2B_1006 


B743 


1317 


3103 


4889 


6675 


784CIP2B_1007 


B764 


1318 


3104 


4890 


6676 


784CIP2B_1.008 


8764 


1319 


3105 


4891 


6677 


784CIP2B_10 09 


8764 


1320 


3106 


4892 


6678 


784CIP2B_1010 


8774 


1321 


3107 


4893 


6679 


784CIP2B_1011 


8782 


1322 


3108 


4894 


6680 


784CIP2B_1012 


8796 


1323 


3109 


4895 


6681 


7 84 CI P2B_1 01 3 


8827 


1324 


3110 


4896 


66B2 


784CIP2B__10 14 


8842 


132S 


3111 


4897 


6683 


784CIP2B_1015 


8842 


1326 


3112 


4898 


6684 


784CIP2B_1016 


8858 


1327 


3113 


4899 


6685 


7 B4CI P2B_1Q17 


8871 


1328 


3114 


4900 


6686 


784CIP2B_1018 


8921 


1329 


3115 


4901 


6607 


784CIP2B — 1019 


8927 


1330 


3116 


4902 


6688 


784CIP2B_1020 


8942 


1331 


3117 


4903 


6689 


7B4CIP2B_JL021 


8994 


1332 


3118 


4904 


6690 


784CIP2B_3.022 


9023 


1133 


3119 


4905 


6691 


784CIP2B_1023 


9028 


1334 


3120 


4906 


6692 


784CIP2B_1024 


9058 


1335 


3121 


4907 


6693 


784CIP2B_JL025 


9058 


1336 


3122 


4908 


6694 


7B4CIP2B 1026 


9079 


1337 


3123 


4909 


bb 95 


784CX.fc , 2B 1027 


9079 


1 HQ 






bt>y x> 


/ all — IJr^iJ JLUZo 


9082 


1 T*0 




4911 


6697 


/ 1> 4 U Air i o^_± 02 9 


90B4 




own 


AQ1 *> 


CCQQ 

oo jo 


I o'kK^Xxfd.a luiU 


9093 


1341 


3127 


»r*4, j 


ot> w 


f0^l_JlJrZtt 1UJX 




1342 


3126 


4914 


6700 




9103 


1343 


3129 


4915 


6701 


1 at K-X.tr 4,D JLUjJ 


9105 


1344 


3130 


4916 


O f 


7R4rTP9H 1 fl"*4 


9151 


1345 


3131 


4917 


6703 


7B4CIPPB "LQ"15 


9161 


1346 


3132 


4918 


6704 


784CIP2B 1036 


9172 


1347 


3133 


4919 


6705 


784CIP2B 1037 


9174 


1348 


3134 


4920 


6706 


7B4CTP2B 1D3B 


9204 


1349 


3135 


4921 


6707 


784CIP2B 1039 


9234 


1350 


3136 


4922 


6708 


784CIP2B 1040 


9235 


1351 


3137 


4923 


6709 


7B4CIP7B 1041 1 


9239 


1352 


3138 


4924 


6710 


784CIP2B1042 


9256 


1353 


3139 


4925 


6711 


784CIP2B1043 


9276 


1354 


3140 


4926 


6712 


784CIP2B_1044 


9345 


1355 


3141 


4927 


6713 


7B4CIP2BJLQ45 


9379 i 


1356 


3142 ; 


4928 


6714 


7B4CIP2B_1046 


9435 


1357 


3143 


4929 


6715 


784CIP2B1047 


9437 


1358 


3144 


4930 


6716 


7B4CIP2B 1048 


9469 


1359 


3145 


4931 


6717 


784CIP2B 1049 


9500 


1360 


3146 


4932 


6718 


784CIP2B 1050 


9502 


1361 


3147 


4933 


6719 


784CiP2B 10"51 


9520 
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f~% ryf~\ Wf^> • 


SBQ ID 


SBQ ID NO: 


SBQ ID 


Priority 


SBQ ID 


OI EUXJL-* 


NO: of 


of contig 


NO: 


docket number^ 


NO: in 


JLtJIly V-Il 


■fn 11- 


n uc I eo t ide 


ot concxej 


corxeeponding 


U. S.S.N. 




1 e*Ttrrt-h* 

AdlUJ 1,11 






OAU XX/ HU. XJ1 


no /«oo iic 


sequence 


xf It **- *- L ^ 






*-\>- i rty-J |*-»f 
Jfi. 1U1 JL LJ 






sequence 






amplication 




1362 


3148 


4934 


6720 


784CIP2B 1052 


9541 


1363 


3149 


4935 


6721 


7B4CIP2B 1053 


9541 


1364 


3150 


4936 


6722 


784CIP2B 1 0S4 


9548 


1365 


3151 


4937 


6723 


7B4CIP2B 1055 


9556 


1366 


3152 


4938 


6724 


784CIP2B 10^6 


9556 


1367 


3153 


4939 


6725 


784CIP2B 1057 


9575 


136B 


3154 


4940 


6726 


7A4f*TDTR 1 n^R 


QRflCt 

y Do? 


1369 


3155 


4941 


6727 




9599 


1370 


3156 


4942 


6728 


7ftAPTDon i nen 


9602 


1371 


3157 




o 


/o^LlrZiJ XUoX 


9606 


1372 


3158 




O /->U 


TftArroon 1 no 
fo4^1r JiJS lUbz 


9622 






4945 


6731 


/ o 4 L. JL P 2. o_± □ 6 3 


9623 


1374 


1 1 tin 

J XOU 


4946 


6732 


/ B 4 L.X P^J1*_1 0 D 4 


9646 


1375 


3161 


4947 


6733 


/a4LiP^B 10 bb 


9747 


x^ /o 




4948 


6734 


/ o 4 L. 1 P2 o_l U b 6 


9773 


ij / / 


iioj 


4949 


6735 


f 84L.XP2B_10b / 


9785 


1378 


3164 


4950 


6736 


/84C1P2B 106B 


9801 


1 no 1 " " " 


3165 


4951 


6737 


784CIP2B_JL069 


9811 




3166 


4952 


6738 


78 4 CI P2B_10 7 0 


9843 


1JD1 


3167 


'4953 


6739 


784CIP2B_JL071 


9854 


13 82 


3168 


4954 


6740 


784CIP2B__1072 


9854 


1333 


3169 


4955 


6741 


784CIP2B_1073 


9864 


13 B4 


3170 


4956 


6742 


784CIP2B_1074 


9864 


1385 


3171 


4957 


6743 


784CIP2B_1075 


9871 


jlo ao 


3172 


4958 


6744 


/84CIP2B - _107b 


9879 


1387 


3173 


'"inFn 

4959 


6745 


784CIP2B_1077 


9861 


1388 


3 174 


4960 


6746 


784CIP2B__1078 


9885 


1389 


3175 


4961 


6747 


784CIP2B_1079 


i 9901 


1390 


3176^ 


4962 


6748 


784CIP2B_JL0B0 


9912 




3177 


4963 


6749 


784CIP2B 1081 


9916 


1392 




4964 


6750 


784CIP2B 1082 


9921 


ijyj 


3179 


4965 


6751 


784CTP2B__1083 


9925 




3180 


4966 


6752 


784ClP2B__l084 


9930 




noi 

J .Lb A 


4967 


6753 


784CIP2B_Jl085 


9949 


1396 


"3 1 

Jlaz 


496*8 


6754 


7B4C1P2B 1086 

r-j 


9951 


x j j* / 




4969 


6755 


/B4CIP2B_J1087 


9959 


1398 


3184 


4970 


6756 


/ o4v.XK<2l3 lUoo 


5o75 


1399 


3185 


4971 


D #3 / 


/ *-X r^i Ja^X li d If 




1400 


3186 


4972 


6758 


/o4i-xjr«J3 xxjyv 




1401 


3*187 


In? 


6759 






1402 


3188 


4974 


6760 




10041 


1403 


3189 


4975 


6761 




10067 


1404 


3190 


4976 


6762 




10073 


1405 


3191 


4977 


6763 


7R4CTP2B 1096* 


10112 


1406 


3192 


4978 


6764 


784CIP2B 1097 


10117 


1407 


3193 


4979 


6765 


784CIP2B 1098 


10132 


1408 


3194 


4980 


6766 


784CIP2B 1099 


10169 


1409 


3195 


4981 


6767 


7B4CIP2B 1100 


10217 


1410 


3196 


4982 


6768 


784CIP2B 1101 


10226 


1411 


3197 


4983 


6769 ; 


784CIP2B 1102 


10232 


1412 


3198 


49B4 


6770 


784CIP2B 1103 


10237 


1413 


3199 


4985 


6771 


784CIP2B 1104 


10279 


1414 


3200 


4986 


6772 


784CIP2C_1 


33 


1415 


3201 


4987 


6773 


704CIP2C_2 


271 


1416 


3202 


4988 


6774 


784CIP2C3 


848 


1417 


3203 


4989 


6775 


784CIP2C_4 


849 


1418 


3204 


4990 


6776 


784CIP2C__5 


864 


1419 


320S 


4991 


6777 


784CIP2C_6 


953 


1420 


3206 


4992 


677B 


784CrP2C_7 


980 


1421 


3207 


4993 


6779 


784CIP2C8 


1595 


1422 


3208 


4994 


6780 


784CIP2C_9 


1697 


1423 


- 3209 


4995 


6781 


784CIP2CJL0 


1744 
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SBQ ID NO: 


SBQ ID 


SEQ ID NO: 


SBQ ID 


Priority 


SBQ ID 


of full- 


NO: of 


of con tig 


NO: 


docket number^ 


NO:in 


length 


full- 


nucleotide 


of contig 


corresponding 


U.S. S.N. 


nucleotide 


length 


sequence 


peptide 


SBQ ID NO: in 


09/488,725 


sequence 


peptide 




sequence 


priority 






sequence 






application 




1 1424 


3210 


4996 


6782 


784CIP2CJL1 


1937 


142S 


3211 


4997 


6783 


784CIP2C_12 


1955 


1426 


3212 


4998 


6784 


784CIP2C_13 


1955 


1427 


3213 


4999 


6785 


784CIP2C 14 


2185 


1428 


3214 


5000 


6786 


784CIP2C 15 


2889 


1429 


3215 


5001 


6787 


784CIP2C 16 


2901 


1430 


3216 


5002 


6788 


784CIP2CJL7 


2902 


1431 


3217 


5003 


6789 


784CIP2C 18 


2905 


1432 


3218 


5004 


6790 


784CIP2CJ19 


2946 


1433 


3219 


5005 


6791 


784CIP2C 20 


2956 


1434 


3220 


5006 


6792 


784CIP2CL21 


2959 


143S 


3221 


5007 


6793 


784CIP2CJ22 


2965 


1436 


3222 


5008 


6794 


784CIP2C23 


2966 


1437 


3223 


5009 


6795 


784CI?2C_24 


2970 


1438 


3224 


5010 


6796 


784CIP2C_25 


2985 


1439 


3225 


5011 


6797 


784CIP2C 26 


2987 


1440 


3226 


5012 


6798 


784CIP2C 27 


2993 


1441 


3227 


5013 


6799 


784CIP2C_28 


2993 


1442 


3228 


5014 


6800 


784CIP2C 29 


3017 


1443 


3229 


5015 


6801 


784CIP2C 30 


3046 


1444 


3230 


5016 


6802 


784CIP2C 31 


3050 


144S 


3231 


5017 


6803 


784CIP2C 32 


3357 


1446 


3232 


5018 


6804 


784CIP2C33 


| 3359 


1447 


3233 


5019 


6805 


784CIP2C_34 


3432 


1448 


3234 


5020 


6806 


784C1P2C_35 


3438 


1449 


3235 


5021 


6807 


784CIP2C 36 


3439 


14S0 


j 3236 


5022 


6B08 


7B4CIP2C39 


3463 


1451 


3237 


5023 


6809 


784CIP2C 40 


3466 


1452 


3238 


5024 


6810 


784CIP2C_41 


3466 


1453 


3239 


5025 


6811 


784CIP2C_42 


3467 


14*4 


3240 


5026 


6912 


784CIP2C_43 


3466 


1455 


3241 


5027 


6813 


784CIP2C_44 


3483 


1456 


3242 


5028 


6814 


784CIP2C_45 


3484 


1457 


3243 


5029 


6815 


784CIP2C_46 


3488 


1458 


3244 


5030 


6816 


784CIP2C__47 


3491 


1459 


3245 


5031 


6817 


784CIP2Cj48 


3493 


1460 


3246 


5032 


6818 


7B4CIP2C_49 


3494 


1461 


3247 


5033 


6819 


784CIP2C 50 


349S 


1462 


3248 


5034 


6820 


7B4CIP2C_51 


3496 


1463 


3249 


5035 


6621 


7B4CIP2C_52 


3503 


1464 


3250 


5036 


6822 


784CIP2C_53 


3503 


1465 


3251 


503/ 


6823 


784CIP2C 54 


3504 


1466 


3252 


5038 


6824 


784CIP2C.55 


3511 


1467 


3253 


5039 


6825 


784CIP2C 5.6 


3531 


1468 


3254 


5040 


6826 


784CIP2C_S7 


3536 


1469 


3255 


5041 


6827 


784CIP2C_58 


35^ 


1470 


32$6 


5042 


6828 


784CIP2C 59 


3548 


1471 


3257 


5043 


6829 


7B4CIP2CJ50 


3551 


1472 


3258 


5044 


6830 


784CIP2C_6l 


3553 


1473 


3259 


5045 


6831 


784CIP2C 62 


3564 


1474 


3260 


5046 


6832 


784CIP2C 63 


3567 


1475 


3261 


5047 


6833 


784CIP2C 64 


3572 


1476 


3262 


5048 


6834 


784CIP2C 65 


3573 


1477 


3263 


5049 


6835 


784CIP2C 66 


3574 


1478 


3264 


5050 


6836 


784CIP2C 67 


3583 


1479 


3265 


5051 


6837 


784CIP2C 68 


3615 


1480 


3266 


5052 


6838 


784CIP2C 69 


3623 


1481 


3267 


5053 


6839 


784CIP2CJ70 


3629 


1482 


3268 


5054 


6840 


784CIP2C 71 


3666 


1483 


3269 


5055 


6841 


784CIP2C_72 


3667 


1484 


3270 


5056 


6842 


784CIP2C_73 


3906 


1485 


3271 


5057 


6843 


784CIP2C 74 


3912 
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SEQ ID NO i 


SEQ ID 


SEQ ID NO: 


SEQ ID 


Prxority 


SEQ ID 


of full- 


NO: of 


of con tig 


NO: 


docket number_ 


NO:in 


length 


full- 


nucleotide 


of con tig 


corre e ponding 


D. S.S.N. 


nucleotide 


length 


sequence 


peptide 


SEQ ID NO: in 


09/488,725 


sequence 


peptide 




secjuence 


pir i o3? i t y 






sequence 






ctpp J. X t_ a t JLOn 






3272 




6844 




i a*? a 


1487 


3273 


5059 






3928 


1 A Oft 




3UQU 


6846 


' B^* ' — 1 ^tll^ / / 




*I A HQ 


3275 


bUbl 


OO** / 






1490 


3276 


5062 


6848 




3981 


1491 


3277 


5063 


6849 




3989 


14 92 


327B 


5064 


6850 


/o4CAJP2C__ol 


4295 


1493 


3279 


5065 


6851 




4300 


1494 


3280 


5066 


6852 


/84CiP2C_o3 


4360 


1495 


3281 


5067 


£853 


784CIP2C 84 


4362 


1496 


3282 


5068 


6854 


784CIP2C 85 


4371 


• 1497 


3283 


5069 


6855 


784CIP2C_86 


4373 


1498 


3284 


5070 


6856 


784CIP2CJJ7 


4376 


1499 


3285 


5071 


6857 


784CIP2C_89 


4378 


1500 


3286 


5072 


6858 


784CIP2C_90 


4382 


1501 


3287 


5073 


6859 


784CIP2C_91 


4409 


• 1502 


3288 


5074 


6860 


784CIP2C_92 


4421 


1503 


32B9 


5075 


6861 


7B4CIP2C93 


4421 


1504 


3290 


5076 


6862 


784CIP2C_94 


4426 


1505 


3291 


5077 


6863 


784CIP2C_95 


4430 


1506 


3292 


5078 


6664 


784CIP2C_96 


4435 


1507 


3293 


5079 


6865 


784CIP2C_97 


4436 


150B 


3294 


5080 


6866 


784CIP2C_98 


4439 


1509 


3295 


5081 


6867 


784CIP2C 99 


4440 


1510 


3296 


5082 


6868 


784CIP2CJL00 


4441 


1511 


3297 


50B3 


6869 


784CIP2C101 


4442 


j 1512 


3298 


5084 


6B70 


7B4CIP2CJ.02 


4455 


1513 


3299 


5085 


6971 


784CIP2C103 


4462 


1514 


3300 


5086 


6872 


784CIP2C__104 


4466 


1515 


3301 


5087 


6873 


784CIP2CJL05 


4469 


1516 


3302 


5088 


6874 


784CIP2C 106 


4477 


1517 


3303 


5089 


6875 


784CIP2C_107 


4481 


1518 


3304 


5090 


6876 


784CIP2C_108 


44B3 


1519 


3305 


5091 


6B77 


784CIP2CML09 


4484 


1520 


3306 


5092 


6878 


784CIP2C_110 


4486 


1521 


3307 


5093 


6B79 


784CIP2C_111 


4490 


1522 


3308 


5094 


6880 


784CIP2C 112 


4499 


1523 


3309 


5095 


6881 


784CIP2C 113 


4503 


1524 


3310 


5096 


6882 


784CIP2C_114 


4506 


1525 


3311 


5097 


6883 


784CIP2C 115 


4509 


1526 


3312 


5098 


6804 


78 4 CIP2C__1 1 6 


4514 


1527 


3313 


5099 


6885 


/84C1 P2C_117 


4516 


1528 


3314 


5100 


6886 


to / nTn">^ t i a 


4522 


1529 


3315 


5101 


6887 




4525 


i tin 


->-ji c 
jjIo 


5102 


OHHO 


TDilPTtnP T)ft 


4i>Z / 




3317 




£L a act 


/ B U v. J. if&Kr^k / 1 






3318 


5104 


oojU 






JL J J J 


3319 


5105 


6891 




4532 


1534 


— -5320 ■ 


iii nc 

51vo 


6892 




4537 


1535 


3321 


5107 


6893 




4538 


1536 


3322 


5108 


DO? 1 * 






1537 


3323 


5109 


6895 




4552 


1538 


3324 


5110 


6896 


784CIP2C 128 


4559 


1539 


3325 


5111 


6897 


764CIP2CJL29 


4567 


1540 


3326 


5112 


6898 


7B4CIP2C_130 


4568 


1541 


3327 


5113 


6899 


784CIP2CJL32 


45B5 


1542 


3328 


5114 


6900 


784CIP2C_133 


4592 


1543 


3329 


5115 


6901 


784CIP2C134 


4609 


1544 


3330 


5116 


6902 


784CIP2C_135 


4616 


1545 


3331 


5117 


6903 


784CIP2C136 


4617 


| 1546 


3332 


5118 


6904 


784CIP2CJ137 


4618 


1547 


3333 


5119 


6905 


784CIP2C 138 


4620 
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SBQ ID NO: 


SKQ ID 


S3Q ID NO: 


SEQ ID 


Priority 


SBQ ID | 


oE full- 


NO: of 


of con tig 


NO: 


docket mimber_ 


NO: in I 


length 


full- 


nucleotide 


of contig 


cor re spondi ng 


U.S. S.N. 


nucleotide 


length 


sequence 


peptide 


SEQ ID NO: m 


09/48B, 725 


sequence 


peptide 




sequence 


priority 






sequence 






etppx J. LdCJun 






-3 -3 j 1 




6906 


f D _l»Xlr— .v. <U7 


4624 


1549 


^TTr 

33 Jt» 


CI *>1 


com 


*itiAr r TB*jr" i An 
/ oii-xirxv-^xfiu 


A 


1550 


3336 


5122 


6908 


7flAf , TTl'>P 1 At 
fQ^l»X_*_;i, XfrX 


ACXA 


1551 




cioi 


6909 


f0*k\ — Lf<;^ XI — 


4dj a 


1552 


3338 


5124 


b^lU 


*7n Ar'TD^r' 1 a** 


jf*C 1 Q 


1553 


3339 


5125 


6911 


to A nTTior 1 ^ a a 
/H4LlFz(_ X<§4 


4643 


1554 


3340 


5126 


6912 


7B4CIP2C_145 


4644 


1555 


3341 


5127 


6913 


/84L.XP2L. 146 


4655 


1556 


3342 


5128 


6914 


7 o 4 CIP2C__1 47 


4668 


1557 


! 3343 


5129 


6915 


784CIP2C_JL48 


4677 


1558 


3344 


5130 


6916 


784CIP2C_149 


AC^h 

4677 


1559 


3345 


. 5131 


6917 


7B4ClP2C_150 


4677 


1560 


3346 


5132 


6918 


7B4CIP2C_152 


4682 


1561 


3347 


5133 


6919 


784CIP2C 153 


4690 


1562 


3348 


S134 


6920 


784CIP2C_154 


4691 


1563 


3349 


5135 


6921 


784CIP2CJL55 


4727 


1564 


3350 


5136 


6922 


784CIP2C_156 


4730 


1565 


3351 


5137 


6923 


784CIP2CJL57 


4734 


1566 


3352 


5138 


6924 


784CIP2C_15B 


4757 


1567 


3353 


5139 


6925 


784CIP2C_1S9 


4764 


1568 


3354 


5140 


6926 


784CIP2C160 


4786 


1569 


3355 


5141 


6927 


784CIP2CJL61 


4793 


1570 


3356 


5142 


6928 


784CIP2C_162 


4825 


1571 


3357 


S143 


6929 


784CIP2C_163 


4826 


1572 


335B 


5144 


6930 


784CIP2C_164 


4850 


1573 


3359 


5145 


6931 


784CIP2CJL65 


4853 


1574 


3360 


5146 


6932 


784CIP2C166 


| 4855 


1575 


3361 


5147 


6933 


784C1P2C 167 


4856 


1576 


3362 


5148 


6934 


784CIP2C 168 


j 4867 


1577 


3363 


5149 


6935 


784CIP2CJL69 


4869 


1578 


3364 


5150 


6936 


784CIP2C_170 


4878 


1579 


3365 


5151 


6937 


784CIP2C__171 


4880 


1580 


3366 


5152 


6938 


704CIP2C_172 


4942 


1581 


3367 


5153 


6939 


784CIP2C_173 


4945 


1582 


3368 


5154 


6340 


784CIP2CML74 


4950 


1583 


3369 


S155 


'6941 


784CIP2C 175- 


4952 


1584 


3370 


5156 


6942 


784CIP2C 176 


4954 


15B5 


3371 


5157 


6943 


784C1P2C_177 


4958 


1586 


3372 


> 5158 


6944 


784CIP2C 178 


4961 


1587 


3373 


5159 


6945 


784CIP2C__179 


5590 


— _____ 

1588 


3374 


?T?K 

5160 


6946 


7B4CXP2L.__loO 


5599 


1589 


3375 


5161 


6947 


784ClrZC__x8x 


5692 


1590 


3376 


5162 


6948 


i a st r*T -i no 
/o4— .XiriiL- loZ 


3 / Ji 


1591 


3377 


5163 


6949 


784dP«6— • 183 


5765 


1592 


33 /o 


riri 


OJ3U 




3 / IX 


1 COO 


"31 TO 


CI cc 




no act HOC inc 


5774 


1594 


jJOU 


Dlob 


6952 


/ O * v* X xr £ X Q a 


5793 


1595 


33 81 


5167 


©i03 




5806 








6954 


/O^Ll^u loo 


$852 


1597 


33 83 


5169 


6955 


^AAPTP^P 1 R9 


5892 


1598 




5170 


6956 




6057 




33 D-> 


CI *ll 

31 / J. 


t»?!3 / 


/O^Llr<tL. X jX 


6061 


1600 


3386 


5172 


6958 


7B4CIP2C 192 


6109 


1601 


3387 


5173 


6959 


784CIP2C_193 


6160 


1602 


3388 


5174 


6960 


784CIP2CJL94 


6297 


1603 


3389 


5175 


6961 


784CIP2CJ.95 


6398 


1604 


3390 


5176 


6962 


784CIP2C 196 


6398 


1605 


3391 


5177 


6963 


784CIP2C_197 


6415 


1606 


3392 


5178 


6964 


784CIP2C_198 


6448 


1607 


3393 


5179 


6965 


784CIP2C_199 


6469 


1608 


3394 


5180 


6966 


784CIP2C 200 


6476 


1609 


3395 


5181 


6967 


784CIP2C 201 


6561 
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SEQ ID NO: 
of full- 
length 
nucleotide 
sequence 


SEQ ID 
NO: Of 
full- 
length 
peptide 
sequence 


SEQ ID NO; 
of contig 
nucleotide 
sequence 


SEQ ID 
MO: 

of contig 

peptide 

sequence 


Priority j 
docket number^ 
corre spond ing 
SEQ ID NO: m 
priority 
dppl ica t ion 


SEQ ID 
NO : in 
U.S.S .H. 

09/4BB, /25 


icin 

1610 


3396 


CI QO 


© a 






1611 


3397 


5183 


6969 




DO /O 


1612 


3398 




Of /U 




6662 


1613 


->->qq 

3399 


C"t DC 


6971 




6672 


1614 


3400 


5186 


6972 




Dor>Ji 


1615 


3401 


5187 


6973 




6695 


| 1616 


3402 


5188 


6974 




6746 


1617 


3403 


5189 




/ 0 4 C 1 Jf*5v_2 1) f 


6898 


1618 


3404 


5190 


6976 


/ 84 CX J*2C_2 1 0 


6938 


1619 


3405 


5191 


6977 


7B4CIP2C 211 


6943 


1620 


3406 


5192 


6978 


784 C1P2C_2 12 


7110 


1621 


3407 


5193 


6979 


784CIP2C_213 


7200 


1622 


3408 


5194 


6980 


7B4CIP2C_214 


7212 


1623 


3409 


5195 


6981 


784CIP2C_215 


7218 


1624 


3410 


5196 


6982 


784CIP2C_2l6 


7249 


1625 


3411 


Si 9 7 


6983 


784CIP2C217 


7500 


1626 


3412 


5198 


6984 


784CIP2C218 


7509 


1627 


3413 


5199 


6985 


784CIP2C_219 


7523 


1628 


3414 


5200 


6986 


784CIP2C_220 


7544 


1629 


3415 


5201 


6987 


784C1P2C__221 


7564 


1630 


3416 


5202 


6988 


784CIP2C_222 


7568 


1631 


3417 


5203 


6989 


784CIP2CJJ23 


7631 


1632 


3418 


5204 


6990 


784CIP2C_224 


7813 


1633 


3419 


5205 


6991 


784CIP2CJ225 


7831 


[ 1634 


3420 


S206 


6992 


784CIP2C_226 


7843 


1635 


3421 


5207 


6993 


784CIP2C_227 


7907 


1636 


3422 


5208 


6994 


784CIP2C 228 


7943 


1637 


3423 


S209 


6995 


784CTP2C_229 


8175 


1638 


3424 


5210 


6996 


784CIP2C_230 


8216 


1639 


3425 


5211 


6997 


784CIP2C 231 


8225 


1640 


3426 


5212 


6998 


784CIP2C_232 


8271 


1641 


3427 


5213 


6999 


784CIP2C 233 


8397 


1642 


3428 


5214 


7000 


784CIP2C_234 


8466 


1643 


3429 


5215 


1 7001 


784CIP2C_235 


8503 


1644 


3430 


5216 


7002 


784CIP2C 236 


8953 


1645 


3431 


5217 


7003 


784CIP2C_237 


9106 


1646 


3432 


5218 


7004 


784CIP2C_238 


9139 


1647 


3433 


5219 


7005 


784CIP2C_23 9 


9555 


1648 


3434 


5220 


7006 


784CIP2C_24 0 


9650 


1649 


3435 


5221 


7007 


784CIP2C 241 


9889 


1650 


3436 


5222 


7008 


784CXP2L_242 


$933 


1651 


3437 


5223 


1 7009 


784C.IP2L._24 J 


QOCl 


1652 


3438 


5224 


7010 


TQ/ir'TDIP A A 

/ o4 V* X rZ C_24 * 


9981 


1653 


3439 


5225 


7011 




746 


1654 


3440 


5226 


7012 




3558 


1655 


3441 


5227 


*rn-i 1 

/U JL -J 


70APTD9n 1 
/ 0 ^ J. Ir^i L_J 


3558 


1656 


3442 


5228 


/UJL4 


^n4^"TDO^> a 


3633 


1657 


3443 




7(11 C 


TRAfTDOT") 
/ O *± >.» X lr *5 i-l 


3658 


1658 


3444 


coin 


/UJL© 


7flAf*TD9tl £ 


3732 


1659 


3445 


eon 




7ft4f*TP9n 7 


4004 


-Loot) 


*5 A A K 




Tfll A 
/UJLO 




4700 


1661 


3447 


5233 


7019 




4703 


1662 


3448 


5234 


7020 


784CIP2D 10 


4774 


1663 


3449 


5235 


7021 


784CIP2D_11 


4894 


16"64 


3450 


. 5236 


7022 


784CIP2D_12 


4918 


1665 


3451 


5237 


7023 


784CIP2D13 


5159 


1666 


3452 


5238 


7024 


784CIP2D 14 


■ 7441 " 


1667 


3453 


5239 


7025 


784CIP2D_JL5 


8673 


1668 


3454 


5240 


7026 


784CIP2J_16 


8679 


1669 


3455 


5241 


7027 


784CIP2DJL7 


8727 


1670 


3456 


5242 


7028 


784CIP2D 18 


8734 


1671 


3457 


5243 


7029 


784CIP2DJL9 


8756 
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SBQ ID NO: 


SKQ ID 


SBQ ID NO: 


SBQ ID 


Priority 


SBQ ID 


of full- 


NO: Of 


of contig 


NO: 


docket number_ 


N0:in 


length 


full- 


nucleotide 


of contig 


corresponding 


U.S. S.N. 


nucleotide 


length 


sequence 


peptide 


SEQ ID NO: in 


09/488,725 


sequence 


peptide 




sequence 


priority 






sequence 






application 




1672 


3458 


5244 


7030 


784CIP2D20 


8818 


[ 1673 


3459 


5245 


7031 


784CIP2D_21 


8844 


1674 


3460 


5246 


7032 


784CIP2D_22 


8B46 


1675 


3461 


5247 


7033 


784CIP2D_23 


8912 


1676 


! 3462 


5248 


7034 


784CIP2D24 


8918 


1677 


3463 


5249 


7035 


784CIP2D_25 


8918 


1670 


3464 


5250 


7036 


784CIP2D_26 


8941 


1673 


3465 


5251 


7037 


784CIP2D 27 


8941 


1680 


3466 


5252 


7038 


784CIP2D28 


8951 


1681 


3467 


5253 


7039 


784CIP2D_29 


8951 


1682 


3468 


5254 


7040 


784CIP2DJ30 


9007 


1683 


3469 


5255 


7041 


784CIP2DJ31 


9012 


i 1684 


3470 


5256 


7042 


784CIP2D32 


9013 


1685 


3471 


5257 


7043 


784CIP2D_33 


9025 


1686 


3472 


5258 


7044 


784CIP2D_34 


9053 


1687 


3473 


5259 j 


7045 


7B4CIP2D_35 


9054 


1688 


3474 


S260 


7046 


7B4CIP2D 36 


9054 


1689 


3475 


5261 


7047 


784CIP2D 37 


9113 


1690 


3476 


5262 


7048 


784CIP2D_38 


9134 


1691 


3477 


5263 


7049 


784CIP2D_39 


9152 


1692 


3478 


5264 


7050 


784CIP2D_40 


9152 


1693 


3479 


5265 


7051 


784CIP2D 41 


9211 


1694 


3480 


5266 


7052 


784CIP2D_42 


9223 


1695 


3481 


5267 


7053 


784CIP2D_43 


9223 


1696 


3482. 


5268 


7054 


784CIP2D_44 


9231 


1697 


3483 


5269 


~70"55 


784CIP2D_4S 


9236 


1698 


3484 


5270 


7056 


784CIP2D_46 


9236 


1699 


3485 


5271 


7057 


784CIP2D_47 


9303 


1700 


3486 


5272 


7058 


7B4CIP2D__48 


9309 


1701 


3487 


5273 


7059 


784CIP2D_49 


9314 


1702 


3498 


5274 


7060 


784CIP2D_50 


9326 


1703 


3489 


5275 


7061 


784CIP2D_51 


9339 


1704 


3490 


5276 


7062 


784CIP2D_52 


9348 


1705 


3491 


5277 


7063 


784CIP2D_53 


9376 


xitit ■ 


3492 


5278 


7064 


784CIP2D_54 


9382 


1707 


3493 


5279 


7065 


784CIP2D_55 


9407 


1708 


3494 


5280 


7066 


784CIP2D 56 


9414 


1709 


3495 


. 5281 


7067 


7B4CIP2D_S7 


9439 


1710 


3496 


5282 


7068 


784CIP2D58 


9485 


1711 


3497 


5283 


7069 


784CIP2D 59 


9493 


1712 


3498 


5284 


7070 


784C*P2D_>0 


9501 


1713 


3499 


5285 


7071 


784CIP2D_61 


" 952* 


1714 


3500 


5286 


7072 


784CIP2D_62 


9526 


1715 


3501 


5287 


7073 


784CIP2D_63 


9551 


1716 


3502 


5288 


7074 


784CI?2D_64 


9557 


1717 


3503 


5289 


7075 


7B4CIP2D_65 


9568 


1718 


3504 


5290 


7076 


784CIP2D 66 


9588 


1719 


3505 


5291 


7077 


784CIP2D_67 


9597 


1720 


3506 


5292 


7078 


784CIP2D_68 


9615 


1721 


3507 


5293 


7079 


784CIP2D_69 


9628 


1722 


3508 


5294 


7080 


784CIP2DJ70 


9649 


1723 


3509 


5295 


7081 


784CIP2D_71 


9652 


1724 


3510 


5296 


7082 


784CIP2D_72 


9660 


1725 


3511 


5297 


7083 


784CIP2DJ73 


9662 


1726 


3512 


529B 


7084 


7B4CIP2D_74 


9725 


1727 


3513 


5299 


7085 


784CIP2D 75 


9746 


1728 


3514 


5300 


7086 


784CIP2D__76 


9777 


1729 


3515 


5301 


70B7 


7B4CIP2DJ77 


9787 


1730 


3516 


5302 


7088 


784CIP2D 78 


9790 


1731 


3517 


5303 ! 


7089 


784CIP2D 79 


9842 


1732 


3518 


5304 


7090 


784CIP2D_80 


9842 


1733 


3519 


5305 


7091 


784CIP2D_81 


9848 
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SEQ ID MO: 


SEQ ID 


SEQ ID NO: 


SEQ ID 


Priority 


SEQ ID 


of full- 


NO: of 


of contig 


HO: 


docket number^ 


NO: in 


length 


full- 


nucleotide 


of contig 


corresponding 


U.S. S.N. 


nucleotide 


length 


sequence 


peptide 


SEQ ID NO: in 


09/488,725 


sequence 


peptide 




sequence 


priority 






sequence 






application 




1734 


3520 


5306 


7092 


784CIP2D 82 


9867 


" i n-i tL 

JL /-i:> 


obt-i 

3521 


5307 


7093 


7B4CIP2D 83 


, 10010 


1736 


3522 


5308 


7094 


784CIP2D_84 


10011 


1737 


3523 


5309 


7095 


784CIP2D 85 


10052 


1738 


3524 


5310 


7096 


784CIP2D 86 


10057 


1739 


3525 


5311 


7097 


784CIP2D 87 


10085 


1740 


3526 


5312 


7098 


784CIP2D 89 


10139 


1741 


3527 


5313 


7099 


784CIP2D 90 


10142 


1742 


3528 


5314 


7100 


784CIP2D 92 


10165 


1743 


3529 


5315 


7101 


784CIP2D 93 


10173 


1744 


3530 


5316 


7102 


784CIP2D 94 


10173 


174S 


3531 


5317 


7103 


7B4CIP2D95 


102^3 


1746 


3532 


5318 


7104 


784CIP2B 1 


3121 


1747 


3533 


5319 


7105 


7B4CIP2B_2 


3628 


1748 


3534 


5320 


7106 


784CIP2E 4 


3673 


1749 


3535 


5321 


7107 


f 7B4CIP2B_5 


4018 


1750 


3536 


5322 


7108 


784CIP2E 6 


4467 


1751 


3537 


5323 


7109 


784CIP2B 7 


4865 


1752 


3538 


5324 


7110 


1" 784CIP2E_8 


4916 


1753 


3539 


5325 


7111 


784CIP2E 9 


4923 j 


1754 


3540 


5326 


7112 


784CIP2E_10 


4926 


1755 


3541 


5327 


7113 


784CIP2E 11 


4962 


1756 


3542 


5328 


7114 


784CIP2E_JL2 


4963 


1757 


3S43 


5329 


7115 


784CIP2B_13 


4964 


1758 


3544 


5330 


7116 


7B4CIP2B_14 


4988 


1759 


3545 


5331 


7117 


784CIP2B15 


5835 


1760 


3546 


5332 


7118 


784CIP2B_16 


7682 


1761 


3547 


5333 


7119 


784CIP2B_17 


7682 


1762 


3548 


5334 


7120 


784CIP2B_18 


7699 


1763 


3549 


5335 


7121 


784CIP2B 19 


7707 


1764 


3550 


5336" 


7122 


784CIP2B 20 


7707 


1765 


3551 


5337 


7123 


784CIP2E_2l 


7752 


1766 


3552 


5338 


7124 


784CIP2E_22 


8357 


1767 


3553 


5339 


7125 


784CIP2E_23 


9065 


1768 


3554 


5340 


7126 


784CIP2B 24 


9324 j 


1769 


3555 


5341 


7127 


784CIP2FJ. 


2976 


1770 


3556 


5342 


7128 


784CIP2F 2 


3559 


1771 


3557 


5343 


7129 


784CIP2P 3 


4021 


1772 


3558 


5344 


7130 


784CIP2F 4 


4474 


1773 


3559 


5345 


7131 


784CIP2P 5 


4566 


1774 


3560 


5346 


7132 


784dP2F_6 


4705 


1775 


3561 


5347 


7133 


784CIP2FJ7 


4707 


1776 


3562 


5348 


7134 


784CIP2P 8 


4712 


1777 


3563 


5349 


7135 


784CIP2F_9 


5008 


1778 


3564 


5350 


/ i.j t> 




5009 


1779 


3565 


5351 


7137 


784CIP2FJ11 


sois- 


1780 


3566 


5352 


7138 


7B4CIP2?_12 


5015 


1781 


3567 


5353 


7139 


784CIP2F_13 


7724 


1782 


3568 


5354 


7140 


784CIP2FJ14 


7725 


1783 


3569 


5355 


7141 


784CIP2P 15 


8828 


1784 


3570 


5356 


7142 


784CIP2F 16 


8830 


1785 


3571 


5357 


7143 


784CCP2FJL7 


9739 


1786 


3572 


535B 


7144 


784CIP2F_18 


9896 | 
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TABLE 7 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D**Aspartic Acid, E= 
Glutamic Acid, F=*Phenylalanine , G=Glycine, 
H-Histidine, I«Isoleucine f K«=Lysine, 
L«Lcucino ; M=>Methionlne, N=Asparagine, 
P=Proline, Q~Glutamine, R=Arginine, 
S=Serlne, T«Threonine, V^Valine, 
W=Tryptopnan, Y=Tyrosine, X= Unknown, *=stop 
Codon, /^possible nucleotide deletion, 
Vpossible nucleotide insertion) 


5359 


337 


1131 


AHLSARLSALX LDEVAILPAPQNLS VLSTNMKHIjIWWS PVIAPG 
ETVYYSVEyOGEYESI*YTSHIWIPSSWCSI»TEGPECDVTDDITA 
TVPYOT,RVRATLGSQTS /CLEHP/VS IPLIBTQ PSLPDL/ RME I 
TKDGFHLVIBI^LGPQFBFLVAYWRREPGAEE^ 
VHLETMEPGAAY CVKAQTFVKA I GRYS AFS QTECVEVQGEAI PL 
VLALFAFVGFMLI LVWPLFVWKMGRLU5/ YLLLPRGGSSQTPW 
KITQF 


5360 


2 


1115 


PRVRSSGGQKDPASQQWARPRPTQP3KMRRRVIARPVG3SVRLK 
C^^GHPRPDITWMKI>IX^TRPEAAEPRKKKWTI*SIiKNLRPED 
SGKYTCKVSireAGAINATYKVDVIQRl^ 

FGGTTS FQCKYRSDVKPVIQH LKRVBYGAEGRHNST1 D VGGQKF 
VVLPTGDVWSRPIX^YI«NKLLXTRARQDDAG^XC£«GANTMG^ 
FRSAFLTVLPDPKPPGPPVASSSSATSLPWPVVIGIPAGAVFII. 
GTLLLWIiCQAQ B3CPCTPAPAPPr A PGHR? PGTARDRSGDKDLPSL 
AALS AGPGVGLCBEHGS PAAPQI ILLGPGPVAG PKLYPKliYTGFlS 
TPHTYTHPPPSCQliNSSHS ) 


5361 


3 


925 


HEGS I S SAN II*IiDDQFQPKJbTDFAMAH?RSHLEHQSCTINMTSS 
SSK^WYMPEEYIRXIGKLSIKTDVYSFGIVIMKVLTGCRVVLDD 
PKHIQl^DIJ^EIWEKRGLmCI»SPI>DKKVPPCPRNFaAKIiFCL 
AGRCAATRAKI^SMDEVIOTLBSTQASLYFAEDPPTSLKSFRC 
PSPLFLKNVPS I PVEDDESQKNNLLPSDBGIjRIDRMTQKTPFEC 
SQSBVMFLSIJ)KKPESK11NEEACNMPSSSCEESWFPICYIVPSQD 
IJIPYKVNIDPSSEAPGHSCRSRPVBSSCS3KFSWDSYBQYKKE 


5362 


2 


4879 


scqvegctrtynssqsigkhmktahpdqyaafkmqrkskkgqka 

NNLKTPNNGKFVYFLPSPVNSSNPFFTSQTKANGNPACSAQLQH 

VSPPIKPAHIASVSTPLLSSMKSVINPNITSQDKNSQGGMLCSQ 

MENLPSTALPAQMEDLTKTVLPLNIDRGSDPFLSLPAESSSIDL 

FPSPADSGTNSWSQLENNTNHYSSQIBGNTNSSFLKGGNGENA 

VFPSQVNVAl^FSSTNAOXiSAPEKVKKDRGRGQTGKEOTPKHNK 

RAKKPAriRTCKPICSRCYRAFTNPRSLGGHLSKRSYCKPLDGA 

EI7^EI^3NGQPSIJJ^MIl^TNA\mi^POX3STFNPBACFKD 

PSFT^LLAENRSPAFLPl^FPRSGVTNFNTSVSQEGSEI I IQAI* 

ETAG I PS TPEGABMLSHVSTGCVSDAS QVNATVMPNPTVP P LI>H 

IVCHPNrLLTNQNRTSNS KTSS IEECSSLPVFPTNDLLOCTVEN 

GI^SSFPNSGGPSQNF^SNSSRVSVISGPQOTRSSHrJ^KKGNS 

ASKPJ^KKVAPPLIAPNASQNLVTSDLTTMGI^AKSVBIPTTI^ 

SNVIPTCBPGSLVBIfLTQKI^^ 

VLAPIiTUCnSNGDSQMMALNSCTTSW^ 

LE I IKTAMNSQ I liEVKSGSQGAGETSQNAQINYNI QLPSVNTVQ 

1^KLPDSSP\FSSFISVMPTESNIPQSB\VSHKBDQIQEIIjEGL 

QXLKLJB^I^TPASQCVLIOTSVT^ 

MIOTQPiroKVlWPFVC^QGCKYSAMT 

MILEI KKNQLKFAPFKCWPTCTKTFTRNSinJIAHCQL 

EEMVKLK I KRPYGPJCSOSENVPASRSTQVXKQIAMTBENKKESQ 

PAI£XJ*AETGj!miSNVAVIPBI^ 

KQCNTNALTNIl|TKGRKIRRHKKfiKEEKKRKKPV3QSLKFPTRY 

SPYRP YRCVHQGCFAAFTIQQNLI LHYQAVHKSDLPAFSAEVEE 

ESEAGKESEKTE'iKQTLKEFRCQVSlX^RI FQAI TGL IQHYMKX. 

HEMTPEE I KSMTASVDVGKFPCDQI £CKSS FTTYLWYVVHLEAD 

K^IGIJRASKTEFDG VYKCDCEGCDRIYATliSbn^LRHI FNKHKDK 

HKAHLIRPRRLTPGQENMSSKANQEKSKSKHRGTKHSRCGKEGI 

KMPKTKRKKKNNLENKNAKIVQIEENKPYSLKR^ 

DALSECTSRFVTQYPCMIKGCTSWTSESNI IRHYKCHKLSKAF 

TSQHRMJjIVFKKCXlNSQVKETSEQEGAKin>VKDSDT 

NSRTTATVSQKEVEKNE*DEMDELTEIjFITKLrNEDSTSVETQA 

NTSSNVSNDFG^DNLCQSERQKASNIJCRVNO 

KAE PASAAEI^SSVRKEEETAVAI QTI EBirPASFDWS 3 F KPMGFE 

VS FIiKFLEBSAVKQlO<NTDKBHPNTGNKK^ 

TSGNHVCPCKESBTFVQFANPSQljQCSDNVKIVLDKNL 
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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A~Alanine, C-Cysteiue, D=Aspartic Acid, B=« . 
Glutamic Acid, F~ Phenylalanine, G^Glycine, 
H=Histidine, I=Isoleucine t K=Lysine, 
I»=Leucine, M^Methionine, N=Asparagine f 
P-Proline , Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=valine, 
W=Tryptophan, Y=Tyrcsine, X=Unknown r *=Stop 
Codon, /=possible nucleotide deletion, 
\=«possible nucleotide insertion) 








VLKQlfQEMKPTVSLKKLBVHSNDPDMSVMKDIS IQKAT0RGQY " 


5363 


8066 


703 


rlcctgggbgtpgasgkrgpaattslvlci psvpp pvpfptlwp 
ppswrrqppggirrdfsrrlrreanlvatclpvraslphrlnml 
rgpgpgllllavlclgtavpstgaskskrqaqqmvqpqspvavs 

QSKPGCYDNGKHYQINQQWERTYLGWALVCTCVGGSRGFNCESK 
PRAEE7CrT>KYTObmfRVGDTYBRPK^ 

CTIANrRCOTGGQSYKIGDT^«RPHETGQY^aECVCI 1 GNGKGB^fT 
CttP IABKCPl^HAAGTSYVVGRTWRKPYQGWMMVIXrrCLGEGSGR 
ITCTSRNROnXJDTRTSYRZGIOT 

EflKO^HTSVQTTSSGSGPFTDVRAAVYQPQPHPQPPPYGHCVT 
DSGVVY£>W3MQLA*KTQGNKQMlACT^^ 

G^SNGEPCVLPFTYNGRTIfYSCTTKGRQIXmWCSTTSNYEQDQ 
KYS FCTTOHTVLVQTRGGNSNGALCHFPF^ 
DNMKWOGTTQNYDAIXJKFGFCPMAAHEBICTTNEGVMYRIGDQW 
DKQHDMGHMMRCrCVGNGRGEWTCIAYSQLRr>QCrVDD ITYNVN 
OTFHKRHBEGHMLWCTCFGQGRGRWKC^PVDQOQDSETGTFYQI 
GDSWEKYVHGVRYQCYCYGRGIGRWHCQPLQTYPSSSGPVEVFI 
TBTPSQPNSHPIQWWAPQPSHISKYILRWRPKNSVGRWKEATIP 
GHLN5YT1KGLKPG WYEGQL IS IQQYGHQBVTRFDFTTTSTST 
PVTSNT\VTGKTTPFS PLVATSESVTEITAS SFWSWVSASDTV 
SQFRVEYELSEEGDBPQYLVLPSTATSV\NI P\DLI,PGRKYIVN 
VYQI S EIX3E0S LILSTSQTTAPDAPPDPTVDQVDDTS I VVRWS R 
PQAPITGYRIVYSPSVEGSSTBIiNLPE*EANSVTI>SDIjQPGVQYN 
ITIYAV^EHQESTPWIQQETTGTPRSDTVPSPRDTjQFVEVTDV 

kvtimwtppesavtgyrvdvip^^pgehgqrlplsrntf\aen 

TGI^PGVTYYFKVFAVSHGRBSKPLTAQQTTKI>\DAPTIII^>FVN 
ETDSTVIiVRWTPPRAQITGYRLTVGLTRRGQPRQYNVGPSVSKY 
PLRNTOPAS EYTVSLVAI KGNQESPKATGVFTTLQPGS S I PPYN 
TEVTETTIVITWTPAPRIGPKLGVRPSQGGEAPREVTSDSGSIV 
VSGIXPGVEYVYTIQVLRIX^EPJDAP\rvNK\VVTPLSPPTNLH 
LEANPDTGVLTVSWERSTTPDITGYRITTTPTNGQQGNSIiEEW 
HADQSSCTF\DNLEVPGLEYNVSVYTVKDDKESVPI SDTI IPAV 
PPPTDIiRFTN/lLGPDTMRVTWVAPPPSIDLTNFLVRYSPVKNB 
GHMLQSLS1FFLSDN\AWLTNLLPGTEYWSVSSVYEQHESTP 
\ LRGRQKTGLDS P \TGIDFS\DI TA\N5FT\VHW\ IAPRA/TP I 
TGYRrR\HHPEHF\SGRPREDR\VPHSHNS ITLTNLTPGTEYW 
S IVAI^RBESPI^ICK3QSTVSDVPRDLBVVAATPTSLI,I \SWD 
APAVTVRYYRI TYGETGGNSFVQB PTVPGSKS TAT I SGLKPGVD 
YTITVYAVTGRGDS PAS 3 KP IS INYRTE IDKPSQMQVTI) VQEJN3 
ISVKM^PSSSPVTX^VTTT\PKHGPG\^ 

EGLQ PTVEYWS VYAQNPSGESQ PIiVQTAVTNIDRPiroiiAFTD V 
DVDSIKIAWF^PQGQVSRYRVTYSSPEDGIHBLFPAPDGEEDTA 
ELQGI^PGSEYTVSVVAIJn^DMESQPLIGTQSTAIPAPTDLKFT 
QVTPTSLS AQWTPPNVQLTGYRVRVT PKE KTG PH KE INLAPDS S 
SVWSGLMVATKYBVSVYALKDTLTC PPRR 
ARVTT)ATETTrTlSWRTKTETITGFQVDAVPANGQTPlQRTXKP 
DVRS YTITGLQPGTDYKTYLYTLNDNARSSPVVI DASTAIDAPS 
NLRFLAl-TPNSLIiVSWQPPRARITGYIIKYEKPGSPPRKVVPRP 
RPGVTEATITGLEPGTEYTIYV I ALKNNQKS E PLIGRKKTDELP 
QLVTLPHPNLHGPE ILDVPSTVQKTPFVTHPG YDTGNG I QLPGT 
SGQQPSVGQQMI FEEHGFRRTTPPTIATPIRHRPRPYPPNVGQB 
ALSQTTIS HAPFQDTS E YI ISCHP VGTDBEPItQFRVPGTSTS AT 
LTQLTRGATYNI IVEAIiKDQQRHKVREEVVTVGNSVNEGLWQPT 
DDS C FD PYTVSH YAVGDEWERMS E SG FKLLCQCLGFGSGHFRCD 
SSRWCHDNGVNYKIGEKWDRQGBWKJM^ FKCDP 
HEATCYDDGKTYHVGEQWQKEYLGAICSCTC 
RPGGK PS P EGTTGQS YNQYSQRYHQRTNTNVNCP IECFM PLDVQ 
ADREDSRE 


53 64 


8066 


703 


RLCCTGGGEGTPGASGKRGPAATTSLVIjCIPSVPPFVPFPTLWP 
PPSWRRQPPGGIRRIDFSRRLRREANLVATCLPVRASLPHRLNML 

rgpgpgllllavlclgtavpstgaskskrqaqqmvqpos PVAVS 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A= Alanine, Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=«Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K«=Lysine, 
L ^Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=»Threonine, V-Valine, 
W-Tryp t ophan , Y=Tyrosine> X«Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








QSKPC^DlJGKHYQINQQWEHTYLGNALVCTCyGGSHGFNCESK " 

PEAEETCFDKVTCNTYRVGIOTBRPKDSM1WDCTCI 

CTIANRCHEGGQSYKIGDTWRRPHETGGYMLECVCliGNGKGEKT 

CKPIABKCPDHAAGTSTVVaETWEKPYQGWMMVDCTCLGEGSGR 

ITCTSRNRCNDQDffRTSYRIGiyraSI^ 

EWKCERHTS VOTTS SGSGPFTDVRAAVYQPQPHPQPPP YGHCVT 
DSGWYSVGMQLA* KTQGNKQML\CTCLG«GVSCQETAVTQTYG 
GNSNGEPCVLP FTYNGRTFYS CTTEGRQDGHLWCSTTSNYEQDQ 
KYSFCTOHTVTiVCrTRGGNSNGALCHFPF^ 

DNMKHCKT1X2NYI>ADQKPCPCPMAAH3EI CTTNBGVMYRIGDQW 
DKQHDMG1IMWRCTCVGNGRGEWTCI AYSQLRDQCI VDD1 TYNVN 
irrFHKRHBEOHMLNCTCFGQGRGRWKCDPVDQCQDSBTGTFYQI 
GDSWEKYVHGVRYQCYCYGRGIGEWHC^PLQTYPSSSGPVEVFI 
TETPSQPNSHPIQWNAPQP3HIS KY I IjRWRPKNS VGRW KEATI P 
GHLNSYTI KGLKPGVVYBGCLIS IQQYGHQEVTRFDFTTTSTST 
PVTSNT \VTGETTPFSPLVATSHSVTEITASS FWSWVSASDTV 
SG FRVEYELSEEGDEPQYIiVLPSTATSV\NIP\DI,LPGRfCYIVN 
VYQISBTCBQSIiILSTS(yirrAPl^PDPTVDQVDDTSlVVRWSR 
POT^ITGYRIVYSPSVEGSSTBIiNLPETANSVTLSDLQPGVQYN 
ITIYAVEENQESTPWIMETTGTPRSDTVPSPRDLQFVBVTDV 
PCVTI MWTP PESAVTGYRVDVI PVNLPGEHGQRLPLS RNT F \ AEN 
TGLS PGVTYY FKVEAVSHGRKSKPLTAQQTTXL \ nAPTPTLQFVN 
ETDS TVLVRWTPPRAQITGYRLTVGLTRRGQPRQYNVGPSVS KY 
PLRl^PASEYT^LVAIKGNQESPKATCVFTTLQPGSSIPPYN 








VSGLTPGVEYVTri<WRIXK2BRnAP\^ 

LEANPDTGVLTVS^RSTTPDITGYRITTTPTNGQQC^SLEEVV 

haix)ssctf\di^ea^leynvsvytvkddkesvpisdtiipav 

PPPTDIiRFTN/IIiGPDTMRVTWVVPPPSIDLTNFLVRYSPVKNE 
GXMLQShS I FFLS DN \AWh TNLLPGTEYWSVS S VYEQHES TP 
\LR(»QKTGIJ)SP\TG1DPS\D3:TA\MSFT\VHW\ iapra/tpi 
TGYR I R\HHP EHF \ SGRPREDR\ VPHS RNS ITLTNI/TPGTEYW 
SrUALNGREESPLLIGQQSTVSDVPRDI»BVVAATPTSLI»l\SWD 
APAVTVRYYR 1 TYGETGGNS P VQEFTVPGS KSTATISGLKPGVD 
YTI TVYAVTGRODS PASSKPISINYRTE IDKPS QMQVTDVQDNS 
ISVKWIiPSSS P VTGYRVTTT\ PKNG PG\ PTKTKTAGPDQTEMTI 
EGIXJPTVEYWSVYAQNPSGESQPLVQTAVTNIDRPKGnAPTDV 
DVDS IKIAWESPQGQVSRYRVTYSSPEDGIHELFPAPDGEEDTA 
ELQGLR PGSB YTVS WALHDDMESQPL IGTQSTAI PAPTDjCKFT 
QVTPTSLSAQVrTPPNVQLTGYRVRVTPKEKTGPMKE INliAPDSS 
SVWSGLMVATICYEVSVYAIiKDTL^ 

ARVTDATETT IT IS WRTKTETITGFQVDAVPANGQTP IQRTI KP 
DVRSYTITGLQPGTDYKIYLYTIJCNARSSPVVIDASTAIDAPS 
NLRFLATTPNSLLV5WQPPRARITGYIIKYEKPGSPPREVVPRP 
RPGVTEATITGI^PG'IISYTIWIALKNNQKSEPLIGRKKTDELP 
QLVT bPHPNLHGPE I LDVPSTVQKTP FVTHPGYDTGNGIQLPGT 
SGQQPSVGQCWIFEEHGPRRTTPPTTATPIRHRPRPYPPNVGQB 
ALSQTT I SWAP FQDTSEY I ISCHPVGTDBEPLQFRVPGTSTSAT 
LTGLTRGATYN 1 1 VEAIiKDQQRH KVRE EVVTVGNS VNEGLNQPT 
DDSCPBPYl^SHYAVUDEWKRMSESGFKIJjCX^CLGFGSGHFRra 
SSRWCHDNGVNYKIGEKMDR(^E1KK2MMSCTCLGNGKGEFKCDP 
HEATC YDDGKT Y H VGKQWQKEYT/JA I CSCTC PGGQRGWRCDNCR 
RPGGEPS PEGTTGQS YNQYSQRYlIQRTNTNVtJCP I ECFMPLDVQ 
ADREDSRE 


5365 




703 


RLOn^GECTPGASGKRGPAATTSLVIiCIPSVPPPVPF 

PPSWRRQPPGGIPJ^DFSRRiRREANLVATCXPVRi^LPHRLNm. 

RGPGPGLIjI^VLCI^TAVPSTGASKSKRO^QQMVQPQSPVAVS 

QS KPGCYDNGKHYQ I NQQWBRTYLGNALVCTCYGGS RGFNCES K 

PEAEBTCFDKYTGNTYRVGiyiTERPKDSMIWDCrrCIGAGRGRIS 

CTIANRCHEGGQSYKIGOTWRRPHET^ 

CKPIAEKCFDHAAGTSYWGETWEKPYO^WMMVIXrrCLGBGSGR 
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5EQ 
ID 
NO: 



Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E& 
Glutamic Acid, F= Phenylalanine, G^Glycine, 
H=Histidine, I^Iaoleucine, K-hysine, 
L=»Leucine, M=Methianine, N*Asparagine , 
P= Proline, Q=Glutaaiine, R=Arginine, 
S=Serine , Threonine , Valine, 
W=Tryp t ophan , Y -Tyros one, X ~ Unknown , *=3top 
Codon, /^possible nucleotide deletion, 
^possible nucleotide insertion) 



ITCttmROXUQmRTSYRIGVTViSKKDmWLUlCICTGmilG 
EVnCCERHTSVQTTSSGSGPFTPVRAAVyQPQPKPQPPPYGHCVT 
DSGWYS VGMQLA* KTQGNKG#L\CTCLGNGVS CQETAVTQTYG 
GNSNGEPCVLPPTYNGRTFYSCTTEGRQIXMLWCSTT^ 
KYS ra'DHTVLVQTRGGNSNGALCHFPFLYNNHim'DCT 
I>^KWCX5TTQNYDAIX5KFGFCPMA7iHBB 1 CTTNBGVMYR IGDQVI 
DKQHD>fGHM^CTOrcNGRGBWTCIAYSQLRDQCIVDDI TYNVN 
DTFHiOiHEBGHMIiKCTCFGQGRGRW KCD PVDQCQDSETGTFYQ 1 
GDSWEKY\mGVRYQCYCYGRGlGEWHCQPLQTYPSSSGPVBVFI 
TBTPSQPNSHPIQWNAPQPSHISKYIUlWia>KBlSVGRWKEATIP 
GHLNS YTI KGLXPGWYEGQL IS I QQYGHQEVTRFDFTTTS TS T 
PVTS>Tr\VTGETTPFSPI.VATSBSVTBrTASSFWSWVSASDTV 
SGFRVEYELSEBGDBPQYIiVLrPS r TATSV\NI P\DIXPGRKYI \W 
VYQ I SBDGBQSIjI LSTSQTTAPDAPPDPTVI)QVDDTS I VVRWSR 
PQAP rTGYRIVYSPSVEGSSTELNIiPKTANSVTLSDLQPGVQYN 
ITIYAVEfiNQESrPWIQQETTGTPRSDTVPSPRDLQFVEVTDV 
KVTI MWTPPESAVTGYRVDVI PVNLPGEHGQRLPLSRNTF\ABN 
TGI»S PGV*rYYFKVFAVSHGRBSKPLTAQQTTKI*\DAPTNLQFW 
ETDSTVLVRWTPPRAQIlt3YRLTVGLTPJ^QPRQYNVGPSVSKY 
PLRlSfoQPASBYWSLVAIK^ 

TEVTETTIVITOTPAPRIGFKXGVRPSQGGBAPRBVTSDSGSIV 

VSGLTPGVF^VYTIQVLRIX^ERimP\IVNK\VVTPLSPPTN^ 

LKANPI>IX?VliTVSWERSTTPDITGYRITTTPTNGQQGNSI*F^ 

HADQS SCTP\ DNLRVPGIiBIOJVS VYTVKXfflKBS VPISDTIIPAV 

PPPTDLRFTN/ ILGPDTMRVTW\APPP SIDLTNFLVRYS PVKNE 

GRMIiQS LS I FE*LS DNNAWLTNLLPGTa YWSVSSVYEQHESTP 

\LRGRQKTGLDSP\TGIDFS\DITA\MSFT\VHSI\IAPRA/TPI 

TGYRIR\HHPEHF\SGRPREDR\VPHSRN3ITLTNIiTPGTEYW 

SIVALNGRJBESPLLIGC^STVSDVPRDLEVVAATPTSIiDI \SWD 

APAVTVRYYR I T YGETGGNS P VQEFTVPGS KSTAT I SGLKPG VD 

YTITVYAVTGRGDSPASSKPISINYRTKIDKPSQMQVTDVQDNS 

ISVKWLPSSSPVTGYRVTTT\ PKNGPG\PTKTKTAGPDQTEMTI 

EGLQPT\reYWSVYAQNPSGRSQPLVOTAVTNIDRPKGLAFTDV 

DVDSIKIAV^PXJG0VSUYRVTYSSPE3GIHELFPAPIX3EEDTA 

EJX)GI^PGSEYTVSVVAIjn)Dl^QPLlGaiQSTAIPAPTDLKFT 

QVTPTSL3AQWTPPNVQLTGYllVRVTPKEKTGPMra 

SVWSGIiMVATKYEVSVYAIiKDTLTSRPAQGVVTTLHW 

ARVTDATBTTI TI S WRTKTET ITGFQVDAVPANGQTP IQRTI KP 

PVRSYT ITGliQPGTDYKI YLYTLNDNARSSPVVI DASTATDAPS 

NIjR flattpns llvswqp prari tgyi IKYEKPGSPPREWPRP 

RPGVTEATITGLEPGTBYTIYVIALXNNQKSEPLIGRKKTDELP 
QLVTL PH PNJUHG PE I LD VP S 7VQKTPFVTHPGYDTGNG I Qh PGT 

sgqopsvgqqmifbeikjfrrttppttatpirhrprpyppnvgqe 
alsqtt is wapfqdtse yi i s chpvgtdeepiiqfrvpgtstsat 
ltgltrgatyni ivem.kdo^rhkvrbevvtvgnsvnegiinqpt 
ddscfppyt^kyavgdewrrwsksgfkllcxkxgfgsghfrc^ 
ssrwcto^vnykigekwiirq<;engg#ms 

HEATCYDDGKTYHVGEO^QKEYI/JAICSCTCFGGQRGWR 
RPGGEPS PEGTTGQS YNQYSQRYHQRTNTNVNCP IECFMPUDVQ 
ADREDSRB 



9066 



703 



RLCCTGGGKGTPGAS GKRGP AATTS LiVLC I PSVPPPVPFPXLWP 
PPSWRRQPPGGlRRDFSRi2LRREAHLVATCLPVRA3IiPHRtiNMI» 
RGPGPGLLUiAVtOXJTAVPSTGASKSKRQAQCWQPQSPVAVS 
QSKPGCYDNGKHYQINQQWRRTYLGNALVCTCYGG3RGFNCESK 
PEAEE^CFDKYTGNTYRVGDTY^ 

CTIANRCHEGGQSYKIGDTWRRPHETGfiYMLBCVCI^GRGEWT 
CKP I AEKCFDHAAGTS YVVGEl-WKKPYQGWMMVDCTCLGEGSGR 
ITCTSRNRCNDG^TRTSYRIGPTWSKKPNRGHLLQO CTGNGRG 
EWKCERHTS VQTTS SGSGPFTDVRAAVYQPQ PHPQPP P YGHCVT 
DSGWYSVGMQLA* KT0X3NKQ^\CTCLGHG^CQF/TAVTQTifG 
GNSNGE PCVLPFTYNGRTFYS CTTEGRQTX^KLVlCSTTSNYEQDQ 
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SBQ 
ID 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(/^Alanine, 0=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F* Phenyl alanine, G=:Glycine, 
H=Histidine, I^Isoleucine, K>Lysine, 
L*Leucine, M-^Methionine, N=Asparagine , 
P^Proline, QstGlutaraine, R^Arginine, 
S«.Serine, T»Threonine, V=Valine, 
W=*Tryptophan, Y= Tyro sine, X^Uhknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








KYS PCTDHTVLVQTKGGKS NGALCHFPPL YlWHNYTD CTSEGRK 
DNMKWCX5TTQNYDADQKPGFCPMAAHBB I CTTNEGVMYR IGDQW 
DKQHDMGI1MMR CTCVGNGRGB WTC IAYSQLRDQCI VDI) ITYNVN 
FTFHKimKBGHMLNCTCFGQGRGRWKCDPVrQCQDSETGTFYQI 
GDSWEKYVKGVRYQCYCYGRGIGEHHCQPLaTYPSSSGPVBVFI 
TETPSQPNSHP IQWNAPQPSHISKY ItiRWRPKNSVGRWKEATI P 
GHLNSYTIKGLKPGWYEGQLIS IQQYGHQEVTRFDPTTTSTS T 
PVTSNT \VTGETTPFS PLVATSESVTEITASSFWSWVSASDTV 
SGFRVEYEI>SEEGDEPQYLVLPSTATSV\NIP\DLIiPGRKYIVN 
VYQ I SEDGBQSLI I^TSQTTAPDAPPDPTVPQVDIJTS IWRWSR 
PQAPITGYRrVY5PSVEGSSTELNLP3TANSVTl^LQPGVQYN 
ITIYAVEEMQESTPWIQQETTGTPRSDTVPSPRDIjQFVEVTDV 

kvtihwtppe^vtgyrvdvipvni*p<?ehgorlplsrntfV^ 
tgi^pgvtyyfkvfavshgre5kpltaqqttkl\daptnlqfvn 
etdstvlvrwtppraqi tgyrltvgltrrgqprqynvgps vsky 

PURNI^QPASBYTVSLVAI KGtflQESPKATGVFTTI*QPGSS IPPYN 

TEVTBTTIVlTWrPAPRIGFKLGVRPSQGGEAPREVTSDSGSIV 

VSGLTPGVEYVYTIQVIjaXK2EimAP\IVNK\VVTPLSPPTNriH 

LEANPDTGVLTVBWEP^TTPDITGYRITTTPTNGQCX^SriEBW 

lIAIXiSSCrrP\DNLBVPGI^YNVSVYTVKDDKESVPlSDTI 1 PAV 

PPPTPLRFTN/ILGPirrMRVTW\APPP5IDLTNFLVRYSPVKNE 

GRMLQSIiS I FFLSDN\AWIiTNLLPGTEYVVSVSS VYEQHE STP 

\tjRGRQiat3IiDSP\TGIDPS\DITA\WSPT\VHM\IAPPA/TPI 

TGYR I R \HH PEHF \ SGRPRKDR\ VPHS RNS ITLTNLT PGTEYW 

SIVAl^GREESPIilGQQSTVSDVPRDIJEVVAATPXSIAI \SWD 

APAVTVRYYRITYGETGGNSPVQEFTVPGSKSTATrSGLKPGVD 

YTITVYAVTGRGDS PA68KP I S IN YRTE I DKPSQMQVTDVQDNS 

ISVKWLPSSSPVTGYRVTTT\PKNGPG\P*rKTKTAGPDOTEMTI 

PGLQPTVEYVVSVYAQNPSGESQPLVQTAVTNIDRPKGLAFTDV 

PVDSI KIAWES PQGQVSRYRVTYS S PEDGIHELFPAPIX3EEDTA 

ELQGLRPGSEYTVSVVALHDDMESQPLIGTQSTAI PAPTDLKFT 

QVTPTSLSAQWTPPNVQLTGYRVRVTPKEXTGPM 

SVWSGLI^TKYBVSVYALKDJ1,T^ 

ARVTDATETTI T I EWRTKTETITGFQVDAVPANGQTPIQRT I KP 

DVRSYTITGLQPGTDYElYLYTLNDNARSSPVVIDAS^rAIDAPS 

NLRFIjATTPNS LLVS WQPPRAR ITGYI I KYKKPGSPPREWPRP 

RPG\n:EATITGLEPGTKYTIYVlAL}aWQRSEPl*IGRlum)ELP 

QIiVTl»PHPNr^PEII^VPSlVUKJPFVTHPGYI>TGNGIQI.PGT 

SGQQPSVGQC^IFEEHGFRRTTPPTTATPIRHRPRPYPPNVGQE 

Ar^QTTISWAPFQDTSEYI IS CHPVGTPBEPLQFR VPGT6 TSAT 

LTGLTRGATYNI IVEAIOTQQRHKVREEVVIT^SYNEGLWQPT 

DDSCFDPYTV5HYAVGDE«ERMSESGFKIxLCQCLGFGSGHFRCD 

5SRWCHDNGVNYKIGEKWPR<K5ENGQMM3CTCL(SJGKGEFKCDP 

HEATCYDDGKTYHVGEQTOKETLGAICS^ 

RPGGE PS PEGTTGQS YNQY SQRYHQRTirrNVNCP IE CFMPlrDVQ 

ADREDSRE 


5367 


235 


3591 


KKILNMLCKKNI VIEYLADI LYKYLYGFCPSGIKKY1»I IHVLRL 

EVMMRKQDS/RIVGNGSBCWt^KELADVIiMDPPMDDQPGElQ^V 
KRSQI^EGlXSPLSNQLSASSTINPVPIiVGLOKPEMSLPVKPGO 
GDSEASSPFTPVADEDSVVFSKLTYIXSCASVSAPRSEVEAIiR^ 
o tt,p cnrni <?t JjvriiSVPNV^Ff;! Vr'LLDPQTNTE IANYPI YKI 
LFCVRGHDGTPE^DCFAFTESHYW^LFRIHVFRCEIQEAVSRI 
LYSPJ\TAFRRSAKQTPLSATAAPQTpDSDI FTFSVSLEI KEDDG 
KGYFSAVPKDKDRQCPKLRQGIDKXIV J YVQQTTNKELAT BRCF 
GLLLSPX3JO)VlWSDMHLLDLE^ 

FQVVNEET PKDKVLFMTTAVDLVI TE VQEP VR PLLETKVRVCS P 
NERLFWPFSKRSTTENFFLKLXQI KQRBRXNNTDTLYEWCLBS 
ESER KRRKTTASPS VRIiPQSGSQSS VI P S P PRDDE E EDNDBPLL 

sgsgdvskecaekii^tngpxi^skhhij^nvrpkqls slvrngv 
pealrgevwqljju3chnndhlvbkyrii*itkespq 
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SEQ 
ID 
NO: 


beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


rXctix tLcu til lu 

nucleotide 
location 
correspondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A** Alanine, O Cysteine, D:*Aspartic Acid, B=» 
Glutamic Acid, P«Phenylalanine # G-Glycine, 
H=Histidine, I^Isoleucine, K«I*ysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginiue, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=»Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








RTFPAHDY FKDTBGDGQDSLYKI CKAYS VYDBBIGY CQGQSFLA 

AVLLLHMPEEQ AFSVLVKI M FTDYGLRFJiFTCQNFEDI jH CKFYQLE 

RLMQK YI PDLYNHFLD I SCiEAHMYASQWFLTLFTAKFPLYMVFH 

IIDLXiLCBGISVTFNVALGLLKTSKDDLLJtTDFEGALKPFRVQL 

PKRYRSEBNAKKIAIEIACttMKISQKKJ^^ 

BDP I EKFERKPSRI^EANMRLF^ENDDLAHBLVTS KIALRKDLD 

NAREKADALNK3LLhtfTKQXLIDAEEEK^ 

LDKAESEIKKNSSIIGDYKQrCSQLSERLEKQQXANKVElEKlR 
QKVDDCBRCRBF^NKEQRVKGISSTKEVIJJEDTDBBK^ 

remkleiaqtki j \qlveasckiqd\i j ehpp*glpfne\vqaa\k 
ktwfnrtlssxktatgvqgketc 


5368 


573 


2014 


GAAAGAADPRRGSU3GRTMLDFAI FAVTFLLALVGAVLYLYPAS 
RQAAGIPGITPTEEKIXJNIiPDlVNSGSLHEFLVNIiHERYGPVVS 
FWFGRRLWS LGTVDVLKQHINPNKTL>D /li F*NHAEVI IKVSIW 
WWQCE*KP\QRKJGl»YK*KyVTD3LICS^ 

petqhXvplsqhmijgfamksvtcmvmgstfeddqevirfqknhg 

TVWSEIGKGFLDGSrj^KMMTRKKQYKDALMQLESVLRNI IKERK 
GRNFSQHIPIDSLVQGNIiNDQQII*EDSMI PSLASCI ITAKLCTW 
A I WFLTTSBKVQKKLYEE I NQVFGNGPVTPEK I BQLRYCQHVLC 
ETVRTAKLTPVSAQLQDI EGKlDRPI IPRETLVLYALGWLQDP 
NTWPS PHKFDPDRFDDEIiVMKTPS S IjGPSGTQECPBIjRPAYMVT 
TV^LS VLVKRLHLLSVEGQVI ETKYELVTS SREEAWI TVS KRY 


5369 


1 


6622 


PRSI^FSLWAEAAVljAIXK3LRRFvRRI^GT^ 

CHCNLFCIiADLTGrKWKBCYVWQGPTSAPlLPPVTEEDPILSSPS 

RCLKADVIK3/ VWRRDQRPERR E \ L * I FWGGEDP \ VIiLTLFTMTY 

QKKKMECGRW^FPf^VLCFSKAVIINI^RCI^RNFVRIGKWF 

VKPYEKDEKPINKSBHLSCSPTPFIiHGDSNVCTSVEINQHQPVY 

LLSP^HITLAQQSNSPFQVILCPFGLNGTLTGQAFKMSDSATKK 

LIGEWKXJFYPISCCLKBMSBEKQEDMDWEDDSI.AAVEVLVAGVR 

MIYPACPVLVPQSDIPTPSPVGSTHCSSSCLGVHQVPASTRDPA 

MSSVTLTPPTSPEEVQTVDPQSVQKWVKFSSVSDGFNSDSTSHH 

GGKI PRKLANK^H/PRVWQECNMNRAXJNKRIOfSAS 

AKVASWDPVEATQRTNCS CLRHKNLKSRNAGQQGQAPSLGQQQQ 

1I*PKHKTNEKQBKSEKPQKRPLTPBHHRVSVSDDVGWD\ADS\A 

SQRLV\ ISAP\DS0\VRPSWIR\TKDVAK\TPOMHGTEMANSPQ 

PPPIiSP\HPCDVVDECVTKTPSTPQSQHFYQHPTPDPLVPSKPM 

EDR 1DSLSQSPPPQ YQKAVE PTVYVGTAVNMBDBANIAWKYYK 

FPKKKDVEFLPPQLPSDKFKDDPVGPFGQESVTSVTBIjMVQCKK 

PLKVSDELVQQYQr Kl^CLSAXASDAEOEPKIDPYAFVEGDEBF 

IiFPDKKDRQNS EREAGKKHK V KUGTS SVTVLSHEEDAMSLFS PS 

IKCmPRPTSEARPPSTSLIYbSDIAVSYTDLDNLFN5DEDEI,T 

PGSKRSAl^DDKASCKESKTGNIJ3PI>SCISTADl^KMYPTPPS 

LEQEIMGFSPMNMNNKEYGSMDITPGGTV^ 

VDBGPCSPKPSEI KDPS YVYKPRNCQ I LVGCSMFAPLKTLPSQ Y 

LPLIKIjPEBCIYRQSWTVGKLELIiSSGPSmPPIKEGDGSNMDQE 

YGTAYTPQTHTSCGMPPSSAPPSNSGAGILPSPSTPRFPTPRTP 

RTPRTTRGAGGPASAQGSVKYENSDLYS pastpstcrplnsve p 

ATVPS 1 PEAHSLYVNLILSESVMNLFKDCNSDSCCICVCNMNIK 

GADVGV Y I PDPTQEAQ YRCTCG FSAVMNRKFGNNSGLPFEDEIjD 

I IGRNTDCGKEABKRPEALRATSAPjnTNGGXiKESHia^DDLI^ 

LQDQCTNLFSPFGAAIX^PFPKSGVISNWVR^ 

LEHGRQ FMDNMS GGKVDEALVKS S CLH PWSKRND VSMQCSQD I L 

RMLLSLQPVI^nAIQKKRTVRPWGVG^PLTWQQFnKMAGRGSYG 

TDESPEPLPiPTFIJX?YDYDYI,VI^PFALPY^PvLMIiEPYGSOjl 

D IAYVVLCPBNEAUTiNGAKS PFRDLTAX YESCHJbGQHRPVSRLL 

TDGIMRVGSTASKKLSEKLVAEWFSQAAIX^ 

CRYDIX;PYIASLPLDSSrXSQPNLVAPTSC«I>ITPPQhmn , GNA 

in , PSATlASAASSTMTVTS<3VAISTS\^ 

SNlJflSGVSSm^PSFPPFGSMNSNAAGSMSTQANTVQSGQLGGg 

<^M^AGISGBSSSI>PTQPKPDVSBSTMDIU)KVGIPTDGJ>SH 

AVTCPPAIVVyTIDPFTYBSTDBSINSSSVWOT/^^ 
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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(JUAlanine, C«Cysteine, D^Aspartic Acid, B» 
Glutamic Acid, P-Phenylalanine, G^Glycine, 
H=Histidine, r«Igoleucine, K=Lysine, 
L=Leucine, M=Methionine, N»Asparagine, 
P= Proline, Q*=Glut amine, R^Arginine, 
S^Serine, ^Threonine, V=*Valine, 
W=Tryptophan, Y^Tyrosine , X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
V=possible nucleotide insertion) 








TLPPHIKS'rVSVQIIPCQYU^PVKHiiDREIYPQHLKSIiAPSAP 
TQCRRPLPTSTNVKTLTGFGPGLAMETALRS PDRPBCIRLYAPP 
FIIAPVKDKO/TBl^TFGEAGQKYNVIiFVGYCI^ 
TDLYGBliIiBTCriNIDVTNRARRKJCSS 

QMSSLPWRVVIGRrx?RIGHGELKDWSCLLSRRNLQSLSKRLKDM 
CRMCGISAADSPSII^ACLVAMEPQGSPVIMPDSVSTGSVFGRS 
TTIJtfMO/TSQIOTPODTS CTHIXW 

APNPNNDGAIXSMGIFDLIiDTGDDLDPDT INILPASPTGSPVIISP 
GSHYPHGGDAGKGO^TDRLLSTEPHE^PNIIiQQPLALGYFVST 
AKAG PLPDW FWS ACPQAQYQC PLFLKAS LHLHVPSVQSDEIiLHS 
KHSHPLI^NQTSDVLRFVLEQYKAIiSWIjTCDPATQDRRSQ^PIH 
FWLNQLYNFIMNMIi 


S370 


1226 


716 


RWSRKLEIiRHAAQATfiSRPPQSQKMHPPTGKEVHALKRLRDSAN 

DHGADPNQRIX3IjGNTPLHliAACrNHVPV I TTLLRGGAR VDAtJDR 

AGRTPLHLAKSKLNXLQEGHAQCLKAVR /HGGEADHPYAEGVSG 

APRAT*AARCSGVFPSPSRWLGSAPWSRSSCTIWSLPIiHEAKCR 

AVRPLSSAAQGSAPSSSSCCTVSTSIJUAre 

GCISWL 


5371 


1331 


167 


IAAMLWKLLLRSQSCRLCS FRKMRS P PKYRP FIACFTYTTDKQS 

SKENTRTVKKLYKCSVDIRKIPJI\*KDGYF*RMKPMLKKLR1/P 

I^ELGADETAVAS ILRRCPEAl VCS PTAVNTQRKLWQLVCKNES 

FXIKI.IBQFPBSFFTIKD05NQKLNVQFFQELGLKNVVISRLLT 

AAPNVFHWVEKNKQMVRIIjQESYIjDVGGS 

NPFIIjLNSPTAIKETIXEFLQEGXSFT^ 

PRSIQNSISFSKNAFKCTOHDLKQIiVLKCPAIiLYYSVPVI^BBSM 
OGLLRKGlSIAQIROTPMVLFXTPQIVQYRIRKX»NSSGYRIKDG 
HLANLNGS KKE P BANFGKI QAKKVRPLFNPVAP LNVEE 


5372 


51 


857 


SPGAQFLWAAPDMPDPLFSAVQGKDEIIJIKALCE^WrXSKG^E 

PLRJCjIillJjFVTBLSGAHNTTVFQGVAGQSIiQVSCPYDSM 

RKAWCRQLGEKGP<^RWSTHNLWlJiSFI^WNGSTAITDDTLG 

GTIjTI TIjRNLQPHDAGI>YQ CQS IiHGSEADTLRKVLVEVIiAI) PLD 

HRDAGT>LWFPG\DLRASRM?MWSTAS?GASWKEKSPSHPLPSFS 

SWPASPSSRF*QPAPSGIJQP<^RSQGHIHPVNWTVAMTQGISS 

KLCQG 


5373 


2B14 


346 


VKKTK3IFNSAMQEMBVYVENIRRKFGVFNYSPFRTPYTPNSQY 
Q&UjIiDPTNPSA<n7AK^DKQ^ 

TGRRISLSDMPRSPMSTMSSVHTGSDVEQDAEKKATSSHFSASE 
ESMDFTJ?KSTASPASTiat30^SI^SPKPFSP0I^APITTKTD 
KTS TTGS I LNLNLI5RS KAEMDIiKBLS ESVQQQSTPVPLI S PKRQ 
IRSRFQIIJLDKI'IESCKAQLGINEI SEDVYTAVEHSDSEDSEKS 
DSSDSEY1SDDEQKS*GT5QEDTEDKEGC0^^BCEPSAVKKKPKP 
TNPVE IKEELKETSPASEKADPGAVKDKAS PE PE KDFSGKAKPS 
PHPIKl)KLKGKDETDSPTV}iriGXiDSDSE\NELVIDIiGEDHSGRE 
GRKNKKEPKEPSPKQDVVGKTPPSTTVGSH5PPKTPVLTRSSAQ 
TSAAGATATTSTSSTVTVTAPAPAATGSPVl^ 
AVQRSCGTSSTVQQKEITQSPSTSTtTIjVTSTQSSPLVTSSGSM 
STLVSS VNGDLP IGTASADVAADIAKYTSKL\ MDAIKGTM \TEI 
YNDLSKN\TTWKAQLAEDSQGIJIIEIEKIjQWIJIOXJEL\SEMKHN 

leltmaemrqs weqkrdrl laevkicqlbxj^o^avdetkkkqwc 
anfkkeai fyccwnts ycdypcq\oahwpei£\mxsctqsatapq 
\qbadae\vntetlnkssqgsssstqsapsetasa\skeketsa 
ekskesgstldlsgsrbtpssillgsnqgsdhsr\snksswsss 
dekrgs\trsdhk/tpstqhgrsllpgkesragtpfiigtsk 


5374 


2814 


346 


VKKTKSIFNSAMQEI^VYVENIPJiJkFGVFNYSPPRTPYTPNSQY " 

OMI^PTNPSAGTAKlDKQEKVKI^FDMTASPlaL^KPVLSGG 

TGREISI^DMPRSPMSOWSSVHTGSDVF^nABKKATSSOT 

ESMDFI»DKSTASPASTKTGQAGSI»SGS PKPPSPQIiSAPrrTKTD 

KTSTTGSIIMI^DRSKAEMDLKELSESVQQQSTPVPLISPKRQ 

lRSRFQLNIJ)KyifiSaCAQLGlNEISEJWTAVEHSDSKDSEKS 

DSS DS8 YISDDEQKS *GT^EDTPJ>I02GtX^KEPSAVKKKPKP 
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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
cotre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=*Alanine, C=Cysteine, D-Aspartic Acid, E=» 
Glutamic Acid, P= Phenylalanine, G^Slycine, 
H-Histidine, Is=Isoleucine, K=Lysine, 
L- Leucine, W=Methionine, N=Asparagine , 
poproline, Q-Glutaraine, R^Arginine, 
S=Serine, T= Threonine, V=Valine, 
W=»Tryptophan, Y=Tyrosine, X=Unknown, *-Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








TNPVEIKEBUCSa'SPASBKADPGAVICDKASPKPBKDP'SGKAKPS 

PHPIKDKLKGKDETDSPTVHIiGLDSDSE\NELVIDLGEDHSGRB 

GRKNKKEPKEPSPKQDVVGKTPPSTTVGSHSPPETPVLTRSSAQ 

TSAAGATATTSTSSTVT VTAPAP AATGS PVKKQRPLLP KE \TAP 

AVQRSCGTSSTVQQKKITOSPSTSTITLVTSTQSSPLVTSSGSM 

STLVSSVNGDLPIGTASADV7\ADIAKYTSKIi\MDAIKGTM\TEI 

YNDLSKN\™KAQLAEDSQGI^IEIEKliQWI*HQQEL\SEMKHN 

LRLTMABMRQSWEQBIU)IU.lAEVKKQr^a^ 

ANFKKEAIPYCO#rrSYCU)YPCQ\QAHWPEH\MKSCTQSATAPQ 

\QEADAE\VNTETLNKSSQGSSSSTQSAPSETASA\SKKKETSA 

EKSKESGSTI«DL3GSR£TPSSII»IjGSNQGSDHSR\SNKSSWSSS 

DEKRGS \TRSDHN/TPS TQHGRSI»I»PGKBSRAGTPFIX»TSK 


5375 


2907 


1.116 


HIPIAEEKPMLKRRCRGPLAMGPAQPRLLSGPSQKSPQTLGKES 
P^RQ<Xn , SVA\QSGAQAPGRAHRCAHCKRHFPGWVA\LWLHTR 
RCQA/RGLPLPCP KCGRRFRHAP FLAliHRQVIIAAATPDWG FACH 
IXXK>S FRG WVALVLHLR^HSAAICAGPPACPKMAR DAFWRRKAAS 
SSILRRCHPSRPRGPRPFICGNCGRSILPTWDQ/LKVAHKRVHV 
SRRP*BRGPPAKVFHGPRPRGPPTGDTPPGPGGDAVDRPF\QCA 
Ca?IOiFRHK\PNLIRSHAACTSGERPH0/CSRECG\KRFTNTOY 
LTS \HRRITHTARQPYPCKECGRRFRHKPNLLSHSKIHKRSEGS 
AQAAPGPGSPQLPAGPQESAAEPTPAVPLKPAQEPPPGAPPEHP 
QDP I RAP PS L YS CDDOGRS ?RLERFliRAH0RQf fTGERP FTCAEC 
GKNFGKKTKLVAHSRVHSGBRP FRLARKCGRRFL PRASQSGGRN 
SABPWAPRFGPFVCPDCGKAFRHKPYLAAHRPIATPAKKPYVCP 
DCRKAFSQKSNIj\ VSHRR IHTGER PYACPDCDRS FSQKSNL ITH 
RK5HIRDGAFCCAlCGQTFDDBERI»IiAHQKKHDV 


5376 


4504 


591 


VSTFSLCLWPAGGGGRGRVSNMAQS KRHVYSRTPSGSRM&AEAS 
AR P LRVGSR V E V 1 G KGH RGT VAY VG ATLFATGKWVGVI LDEAKG 
KtnXJTVQGRKYFTOTEGHGIFVRQSQIQVFEIXSADTTSPETPDS 
SASKVLKREGTDTTAKTS KLRGLKP KKAPTAR KTTTRRP KPTRP 
ASTGVAGASSSLGPSGSASAGEIiSSSEPSTPAQTPLAAPI X PTP 
VLTSPGAVPPLPSPSKEBEGLRAQVRDIjEEKLETIiRJLK^ 
KLKELEKHKIQI^QVQEHKSKMQEQ<^LQRRLKEARKEAXEAI j 
EAKKR YMEEMADT ADAI EMATIjDKEMAEKRABSIiQQEVEAIjKER 
VDELTTDI^IiKAEIEBKGSDGAASSYQLKQLEEQNARLKDALV 
RMRDI>SSSBKQEHVK\IiQKlMEKKNQBliBVTO^ 
AESTIDELKEQVBAAIiGAEEMVEMLTD^ 
DIiEAMNEM^BLQENARETKLELREQLDMAGARVRE 
QBTVADYQQTI KKYRQLTAHLQDVNRELTNQQBAEVKRQQQPPP 
ETFDFKI KFAETKAHAKA I EMELRQMEVAQANRflMS LLTAFMPD 
SFLRPGGDHDCVLVLIiLMPRIiICKAELIRKQAQEKFKLSENCSE 
RPGLRGAAGEQLSFAAIGLVY\SLMPAAGHRYHRY* CHALSQCR 
LD\VYKKVGSLYPEMSAHERSIjDFTjIEIJ^ 
KAIKYYQHLYS IHIJ^PEDCTMQI^HIKFIXJSALDCMS VEVG 
RLRAFLQGGQKATD lALLLRDLETS CS \ DIRQFCKKIRRRMPGT 
DAPGI PAAiAFGPQVSDTUiDC3lKHLTWVVAVl^EVAAAAAQLI 
APIiAENEG LLVAAIiEEIiAFKAS EQ IYGTPSSSP YECLRQS CNI L 
ISTl^K\LVTAMQEGBYDAERPPSKPPP\VELRAAAIiRAEITDA 
EGIX3UCUBDRBTVI KBLKKSLKlKGEEItSEANVRLTIjIJSKKIJ)S 
AAIODADER IEKVO^RIaEETQAL[^KEKEFEETMDAIXJADIDQI> 
EAEKAELKQRLNSQSKRTIEGLRGPPPSGIATLVSGIAGEEQQR 
GAIPGQAPGSVPGPGLVKDSPLI^jQQISAMRLHISQLQHENS I L 
KGAQMKASIJ^LPPUIVAKI^HEGPGSELPAGAI.YRKTSQLLET 
LNQLSTHTHWDITRTSPAAKS PSAQLMEQVAQLKSLSDTVEKL 
KDEVLKBTVSQR PGATVTlT>FATFPSSAFLRAKEECK?DI>rvyWG 
KVTFS CAAGFGQRHRLVLTQE QliHQLHS RLIS 


5377 


762 


1106 


DVPCKRVLPAEAQEKGQLTLSCGESGEEG\F*YHEVRQAEGES * 

/WFGPNVRI.VHTQLKTKKPSGTTjKAKFYI)HTC 

SS*WPGYDGWWGGQYIFIFRGMRWEBQP 


537B 


2009 


664 


QASGTTIiRPIiPDI,PQLKRREAT3 RNRALKPRGRIiVLMTSCLPAI* 
RFIATPRI^AMPHID^^3VKIJ)F^CDVLI>RPKRSTLKBRSBVDI J TR 
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SEQ 
ID 
»0: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


predicted^ end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(ArAlanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H^Histidine, I=Isoleueine, K=Lysine, 
L-Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R^Arginine, 
S=Serine , ^Threonine , V=Val ine , 
W-Tryptopnan, Y-Tyronine, X=Dnknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\-poseible nucleotide insertion} 






t 


S FS FRNSKtfTYSGVP 1 1 AA1WDTVGTFS4AKVLCKS * V PGS F WD 

V PQMGCVFLI YKLPTLKWKMLLLS VLL PAS I LVAEKFSLFTAVH 

KHYSLVQWQBFAGQNPDCLBHLAASSGTGSSDFKQLEQILBA1P 

QVKYICLDVANGYSEHFVEFVKDVRKRFPQHTIM^ 

B EL I LSGADI I KVG I G PGSVCTTRKKTGVG YPQIjSAVME CADAA 

HGLKGHI I SIX^C5 CPGDVAKAFGAGADFVMLGGMIAGHSESGG 

KLIBRDGKKYKLFYGMSS*I \ AM \ KKYAGGVAE YRASEGKTVEV 

PFKGDVEfn*IRDILGGIllSTCTYVGftAKLKBLSRRTTFI RVTQQ 

VNPIPSEAC 


5379 


2009 


, 664 


QASGTTLR PLPDLPQ LKRRKATS RNRALKPRGRLVLMTS CLGPAL 
tjOTa tdd rci udut nwranrr.n v nrrvtfr.r .d p in? q tt xt q p c xnmci .*pp 

He XJ\X Irtil^JnS^trtlXlJpiMJv tvL**/* iVlJ VliijrNXrivtvo JL uJt\Oit£?Ei VL/JLi .Lit. 

SPSFRNSKQTYSt3VPr:DtoNMDTVGTFBM^ 

VPQMGCVFLI YKIiFTLKHKMLIiLSVIiLPAS I LVAEKFSLFTAVH 

IOIYSLVQWQEFT^GQNPDCIjEHLAASSGTGSSDFEQLEQILEAIP 

QVTCYICLDVANGYSEHFVBFVKDVRKRFPQHTrMAGNVVTGKMV 

EELILSGADI I KVGIGPGSVCTTPJCKTGVGYPQLSAVMECADAA 

HGLKGHI I SDGGCS CPGT3VAKAFGAGADPVMUX34LAGHSESGG 

HLIERDGKKYKLFYGMSS * I\AM\KKYAGGVAEYRASEGKTVEV 

PFKGDVBOTIRDILGGIRCTCTYVGAWQj^ 

VNPIFSEAC 


5380 


2 


2050 


PSRAGGAERGRAAAARS PGGSAAGWECPSVLDEAGACTMSSCVS 
SQPSSNRAAPQDB3iGGRGSSSSESQKPCEAI»RGLSSI^IHLG»rc 
SP IVVTECEPG^VDIXilARDRPIiEADGQEVPLDTSGSO^RPHL 
SGRKI^l^ER5QGGIiAAGGSLDMNGRCICPSLPYSPVSSPQSSP 
RLPRRPTVESHHVS ITGMQDCVQLNQYTLKDEIGKGSYGWKLA 
YNBNDNTYYAt^KVLS KKKXiIRQAAF PRRPPPRGTRPAPGGCIQP 

F\ELVNQGPVMEVPTLKPLSEDCJ\RFYFQDLIKGIE^fLHYQKl I 
H\RD IKPSNLLVGEDGHI KIADPGVSNE FKGSnALIiSMTVGTPA 
FMAPESLSETRKIFSGKALDVWAMGVTLYCFVFG*CPFMDERIM 
CLHSKIKSQALEFPDQPD I AEDLKDLITRMLDKNPESR I WPE I 
KIjaPWVTRHGASPLPSEDBNCTLVEVTEEEVENSVKHI PSLATV 
ILVXTMIT*KRSFt^PFEGSRREERSL8APG3nA 
SBLKT*iaSPLPACCKVT* EFPHPSGC2RPSCWQPPFI*HTHSQPR 
*PEPPRTDRALCPYETGRTCWAPLLQVLWWVGTPLPFPLSTSWL 
PDLVGAPGSHFCFLNIAL^RYNSHTM 


S3B1 


2 


2050 


PSRAGGAERGRAAAARSPGGSAAGWE CPS VLDEAGACTMSS CVS 
SQP^SNRAAPQDEUM^SSSSESQKPCBAtaGLSSLSlRaX^ 
SFIVVTECBPGCAVDIX5IAPJ)PJPLB^ 

SGPJO»SIiQBRS0<3GIAAGGSLDMircRCICPSLPYBPVSS PQSSP 
RLPRRPTVESHHVS I TGMQDCVQLNQ YTLKDB IGKGSYGWKXA 
YNEI^NTYYAMK\njSKKKIjIROAAFPRRPPPRGTRPAPGGCrQP 
RGPI VeQVYQE IA\ ILID^I.DHPNVV\KLVEVL\DDPNEI)HLYMV 
F\BLVNQGFVMEVPTIiKPI#SEDQARFYF^ I 
H\RD I KPSNLLVGEDGH I KI ADFGVSNE FKGSQALLSTjJTVGTPA 
FMAPES1^ETRKIFSGKALDVWAMGVTLYCFVFG*CPFNDERIM 
CLHSKIKSQAL£FPDQPDIAEDIJa^LrTRMIJ)KN?BSRIVV^ 
KLHPWVTRHGAB PLPSEDENCTLVEVTEEEVEHS VKH I PSLATV 
ILV1HT4IRKRSFGNPFEGSRREERSLSAPGNLLTKKPTRBCBSL 
SBLKT* KI SPLPACCKVT* EFPHPSGCRPSCHQPPFLHTHSQPR 
♦PEPPRTOEALC^YETGRTCmPKLQVLWWVGTPLPFPLSTSWL 
PDLVGAPGSHFCPTiNIALLRYNSHTM - 


53B2 


153 6 


203 


GARGSQQnAPALQBABVRGPERAQPARGROTKARLFRLWLVLGS 

VFMILLIIVYWDSAGAAHFYIiHTSPSRPHTGPPLPTPGPDRDRE 

LTADSDVDEFII)KFLSAGVKQSI)LPRKETEQPPAPGSMEESVRG 

YDWSPRDARRSPDC^QGAEPJ*SVIJtGFCANS^ 

DIPNSEl^HLIVDDP^GAIYCYVPKVACI^KRVM^ 

P^FHlDPraiPREHVHNASAHLTFHKFW^^ 

KKYTKFI*FVRDP FVRLISAFRSKFBLENEEF/ * PQVRRAHAAAV 

RQPHQPARLGARGLPRWPQ\VSPANPIQYIJJDPK^3KLAPFNEH 

WRQ VYRLCHPCQ I D YDFVGKLETLDEDAAQIXQLIjQVDIAA^ 
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ID 

NO: 


rXcCuCCSfi 

beginning 

mif 1 Ant* { A** 

location 
cor r c spondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=AspartiC Acid, E» 
Glutamic Acid, F=Phenylalarine, G=Glycine, 
H=Histidine, l=*Isoleucine, K= Lysine, 
L=Leucine, M=Methionine, N=*Asparagine , 
P=Proline, Q=Glutamine, R^Arginine, 
S=Serine, T=Threonine, V»Valine, 
W=Tryptophan r Y»Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\ -possible nucleotide insertion) 








PEI^PGTGPPSSWBEDWFAKIPIJ^QQLYKI.YKADFVLFGVTKP"" 
BNLLRD 


5383 


45 


5250 


VERIJ^CRNSKRTWHMLISK^PWRRLQGISFGMYSAKEIjKXLS 
VK5ITNPRYIJ>SLGNPSANGLYDLAIiGPADSKB\rcSTCVQDFSN 
C$GHI£HIELPI»TVYNPIiDFDKLYI^^ 

HLIiLCQLRVLE VGALQAVYEI»KRI LSRFLE ENADPSASE I REEL 
EQYTTEl^CHflNIjLGSQaAHVKNVCBSK^ 

PHCXTGRSVVRKEHNSKIiTITPPAMVHRTAGQKDSEPIjGIEEAO 

IGKi^yTjTPTSARBHLSAIjWKNEGFFI^JYLFSGMDDDGMBSRFN 

PSVFFLDFLVVPPSRSRPVSOXSIXJMFTNGO^^ 

IRKIJJVLMAQEQKXPEEVATPTTDEEKDSLIAJDR5PI^TLPGQ 

SLIDKXjYKIKIRLQSHVNIVFDSEMDKIjMMDKYPGIRQIIiEKKB 

QPVTP WNVQEIjKQAVIHGPNVHPGASMVINKDGS RTALS AVDHT 
QREAVAKQIiLTPATGAPKPQGTK I VCRHVKNGD I LLLNRQPTLH 
RPSIQAHRARZLPEEKVlJil^ANCKAY^^PIXSDEMNAHFPQS 
ElX3PJ\EAYVIACnXXiYLVPK^^ 

CTPTREHYMBLVYRGLTDKVGRVKLLSPS ILKPFPLWTGKQW3 
TLLINI I PBDHI PLNLSGKAKITGKAWVKBTPR3 VPGFNPDSMC 
ESQVI I REGE LIX3CWLDKAHYGS S AYGLVHCCYE I YGGETSGKV 
LTCLARLFTAYLQLYRGFTL3VED I LVKPKADVKRQRI I EESTB 
CX5PQAVRAAiNLPEAA5TO5^GKWQl)AHLGKDQRDFNMI DLKF 




• 




MQISCIiTjGQIELEGRSTPLMASGKSLPCFEPYEFTPRAGGFVTG 
RFLTGIKPPEFFFHC^GREGLVDTAVKTSRSGYLQRCIIKHLE 
GLWQYDLTVRDSDGSWQFLyGEDGLDIPKTQFLQPKQFPFLA 
SNYEVIMKBQHLHEVIjSRADPKlQu^IfflFRAIKKWQSKHPNTLI^R 
RGAFLSYSQKIQEAVKALKLE3ENRNGR/RPWDS /G/RMLRMWY 
ELDEE SRRKYQKKAAAC P D PSLSVWRPDIY FAS VS ETFETKVDD 
YSQBWAAQTEKSYEKSELSLDRXRTLLQL\KWQRS LCEPGEAVG 
LLAAQS XGBPSTGilTI^^FHFAGRGEMNVTEXjIPRLREIIjMVAS 
ANIKTPMMSVPVLNTKKAlJCRVKSLKKQIiTRVCljGE DVQ 
ESFCMEEKQNKFQ VYQLR FOFLPHAYYQQEKCLRPEDI LRFMBT 
RPFlQ^ESIKKKNNKASAFRMNm^ 

EQEGDEEEBGKIVDAEAEEGDADASDAKRKBKQEEEVDYBSEEB 

EEREGEENDDBDf^EBRNPHREGARKTQEQDEEVGL/GH*GGPV 

PSRPPDAAPETHPQPGAPGA\EAMERRVQAVREIHPFIDDYQYD 

TEESLWC^VTVKLPI^INFDMSSLVVSIiAHGAVIYA 

LLNETTNNKNE KELVLNTEG INLPEL fky ab vldlrrlysnd ih 

AlANTYGIEAALRVIBKEIKDVFAVYGiLAVDPR^ 

F^SVYKPLNRFGIR^SSPLQXJMTFKFSF 

SPSACLWGKWRGGTGLPELKQPLR 


5384 ' 


196 


POD 


UyCGQRI*PTVL*I**GPPGSCPCXIjSI*F\PGRPH^ 
TtljKGDKGDPXSPMGLPGYMGREGPQGBPGPQGSKGDKGEMGSPG 
APCQKRFFAFSVGRKTALHSGEDFQTlJbFERVFVNLDGCFlD^ 
GQFAAPLRG I YFFSLWVHS WOTKETYVHIMHNQKK AVI L YAQPS 

SGHUKAEDD 


5385 ~ 


326 


799 


LMVPRTKKEAPAPPKAEAJKAKAL\KAKKAVLK^ 
SPTFRI^Kl!L*I^QPKyPWKSTPRl?NKI»DHHVI I KFPLTTE* A 
VXKI KNNS LbVFTATOVKANKHQI KQAVKK/ LOT T Q 
SDGERKAY VR LA PDYDALWATKIGIT 


5386 


326 


799 " " 


LMVPRTKJU^PAPPKAEAKAEAL\KAKKAVLKDV^ 

S PTFRRP KTL * LRRQPKYPWKSTPRRNKLDHHVI I KFPLTTE * A 

VKKI ENNSIJ, VFTVDVKANKHQ I KQAVKK/ LCD IDVAKVNTL IQ 

SDGERKAYVRIiAPDYDALWATKIGIT 


5387 


2 


2117 


FWAASGGCW FVLGERRAGSIiLSAS YGTFAM PGMVLFGRRWA'IA 
SDDLVFPGFFELVVRVr.WWIGILTLYLMHRGKLDCAGGALLSSY 
L I VLM I LIiAWICTVSAI MCVSMRGTI CNPG PRKSM SKLLYTRL 
ALFFPEMVWAS I/5AA WADGVQCDRTVVNG I IAT7VVS W I IIAA 
TWS IIIVFDPIX3GKMAPYSSAGPSHIJ)SHDSSQIJ^GLK^^T 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
correspondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E~ 
Glutamic Acid, ^Phenylalanine, G-Glycine, 
H=Histidine, I-Isoleucine, K^Lysine, 
L=Leucine, M=Methionine, H-Asparagine , 
P=Proline, Q^Glutamine, R^Arginine, 
S=Serine, T=Threonxne, V=Valine, 
^Tryptophan, Y-Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\apossible nucleotide insertion) 




- 


- 


SVWBTHIKIjLCCCIGKDDHTRVAPSSTAELFS , rYPSDTDIfVP9D 
IAAGLAIiLHQQQDNIRNNQBPAQWCHAPGSSQEADLDAELKNC 
HHYMOFAAAAYGWPLYI YRNPI/TGIiCR TCk^nrrR^KTvipnTMr /m 
VGGDQLQIi/ CTSAP ILHTHRAAVQGLHPRQLPWTRFTELPFLVA 
IiDHRKBSVVVAYRGTMSLQDVIjTDIiSAfiSBV 
KGISQAARYVYQRLINDGILSQAFSIAPKYRLVIVGHSLGGGAA 
AI^TMVI^YPQTOCYAFSPPIWLWSKAI^SQSFIVSLVLG 
KDVIPRLSVTNLBDLKRRILRV^ 

GNPNNLPTELDGGDQEVLTQPLLGKQSLIiTRHSPAYSFSSDS PL 
DSSPKYPPLYPPGRT TCn*nT?RGft<tt?R WtrY**;Ji MrvQ&WMcwiytti? 
FSK I L I GPKMLTDHM PD I LMRALDS WSDRAACVS CPAQGVS S V 
DVA 


5388 


1569 


753 


TADGGAGGGGPJiQAGVPJUr^LYPFTGGYRRRRAACOARRPAARS 
KDTDLAAYQKGNLGVQLRNMAQE TNHSQVPMLCS TTOGFYGN PR 
TNGMCS VCYKKHLQRQNSSNGR ISP PVQCTDGS VPEAQ SALDST 
SSSMQPSPVSNQSLLSBSYASSQU>STSVDKAVPETEDVQASVS 

TVVYTVT©4YTIALTITK^LKNFVFQQEFKSFGS FHQQI»LE YK 
ILBHLQTKN 


5389 


1569 


753 


TADGGAGGGGRRQAGVRRHY LYP FTGG YPJiRRAAC!QAERPAARS 
KDTDIiAAYQKGNLGVQLRKMAQKTNHSQVP 
XT^CSVCYKKHliQRG^SSl^RISPPVOCroGSVPSA^ 
SSSMQPSPVSNQSLl^BSVASSQLDSTSVDKAVPETBDVQASVS 
OTACXJPSEBQS KSIiE\NIWKKRIAV5CA 

TVVYTVTQMYT1ALTITKQMLKNFVFQQEFKSFGSFH0QLLEYK 

X UiUl JL>v L AH 


5390 


217 


1332 


EDPRKI^EDKMWSBCEGPP^SLVCXTDFG^UlAREQLSkSTRbFI 
EGGADDSITRDDNIATVFKRIPJLRPRYIiRDVSRVDTRITIQGEEI 
SAP I C IAPTGFHC1>VWPDGKMSTARAAQAA\G I CYI TSTFAS CS 
IJ2DI VIAAPEGUWFQLYVHPDI^LNKQ^ 
LDTPVCGNRRHDIRNQI.RRNIiTriTDLQSPKKGNAIPYFX5MTPlS 
TSLCHNDLSWFQS 1TRLP I I LKG IXTKKDAKIAVKHNVQGI I VS 
NHGGRQLDEV1AS 1 UMjTEVVAAVKGKIEVYLDGGVRTGNDVIiK 
ALALGAKCIFLGDAI LWAIAS KGKHGVKKVLN I I»TNEFHTSMA\ 
LTGCRS VAE INRNLVQFS RL 


5391 


1 


1292 


vivnn/iunonurf j. f\\ri^r\^n \^xj DU\ir\j l vluKKuuVHAn VixJcSJc4KUc»c 

QPPVQIKIjMHQBQI»KVMFVGX3PT3TRKDY^ 
WjRVLE^KHRDWIRQGE I FLLPARVPHSPORFANTVGLVVER 
RRIiETEIiDGraYYVGDIMDVL IQEFFS 
SB0Y^TGKPXPIX)I*LKEPPFPLSTRSIMEP14SIiDAWIJ>SHHRK^ 
QACTPI£LFGDTYF/rQV:iAYGCCSS 

VlT^PJlLSLGPWMDSIJjVr^WGPSY \AW\ERTQGS VALSVT\Q 
DPACKKS P^GEPSaiGLKAATGVPSTLEVPSLPNNSPS PHYIiSV 
YCRCVPHRPAHCCMPPSCPSQPRCHAPGKAAAPHLLWOTO 
PVLPGOliPPAPIJ^PI PI^LQTQCSTSTPRRPSIKAS 


5392 


1 


1623 


IRGSNAQKVVGASGSGGAGPQPDPAGPGG\^AtAAAVLGACEPR 

CAAPCPLPALSRCRGAGSRGSRGGRGAAGSGDAATVAAEWIRKGS 

FIHKPAHGWLHPOARVI/3PGVSY 

ROXiVTREAJlJRIJIKAVPGVRGSWKKKAPNK^^ 

GMS I S IHISTDGLS LSVPATRQVT ANHHM PSI SFASGGDTDMTD 

YVAYVAKDP INQRACH I LECCEGL\AQS I ISTVGQAFELRFKQY 

LHSPPKVALPPERLAGPEESAWGDEEDSLEHNYYNSIPGKEPPL 

GGLVDSRIiALTQPCALTALDQGPSPSLPJ^AC^LPWDVGSTGTAP 

PGDGYVQADARGPPDHEEHLYVNTQGLDAPKPETDSPKKnriFDMR 

PFEDALKLHECS VAAGVTAAPLPIjEDQWPS PPTRRAP VAPTEEQ 

LRQEPKXTIGRMSPJLAABRMLRADGDFLVRBSVTNPGO^ 

AGQPKIfl^JjVDPEGVVRTKDVLFESISHLIDHHIXJircQPIVAAE 

SELHLRGWSREP 


5393 


2 


982 


GGDSAGMTMElX}MSQNVCTRNLWIJjQPLTV^ 
PKAVLIO.EPPWlNVLQ\EDSVTLTCQGAPQi 3 /KRSDSIQWFlING 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F= Phenyl alanine, G "Glycine, 
H-Histidine, I*»Isoleucine, K-Lysine, 
L«Leucine, M=Methionine, N=Asparagine, 
P= Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=VaXine, 
W=»Tryptophan, Y=Tyrosine, X=UnJcnown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=posslble nucleotide insertion) 








\NLI PTHTQPS\YRFKA1^\DSGBYTCXJTGQTSIj\SDPVHLTV 
LSEWLVLQTPHLBFQEGBTIMIiRCHS \WRDKP\ LVKVTFFQNGK 
SQKFSHIiDPTFS IPQANHSHSGDYHCTGNIGyTLFSSKPVTITV 
QVPSMGSSSPMGIIVAWIATAVAAIVAAWALIYCRKKRISAN 
STDPVKAAQPEP PBRO^IAIRKRQliKBTNWDYETADGGYMTLHP 
RAPTDDDKNIYLTLPPNDHVNSNN 


S394 


2 


982 


GGD&AGMTMETQMSQjNVCPRNLW 

PKAVWa^PPWINVLQ\EDSVTLTCQOAPQP/BRSDSIQWEHire 
\NIiIPTHTQPS \ YllPKA^ffilN\DSGE YTOQTGO/TSIi\SDPVHLTV 
LSEWLVKJTPHIiEFQEGBTIMLRCHS \WRDKP\LVKVTFFQNGK 
SQKFSHLDPTFSIPQANHSHSGDYHCTGNIGYTLFSSKPVTI^ 
QVPSMGSSSPMGIIVAWIATAVAAXVAAWALIYCRKKRISAN 
STD PVKAAQ FE PPGRQM IAI RKRQLEETNNDY ETADGGYMTLNP 
RAPTDDDKNIYLTLPPNDHVNSNN 


5395 


3135 


531 


RASDAKNQEGLLNTRRKSTDSVPISKST^ 
SSGNPEAVALAPDAYSTGSSSASSTLKRTKKPRPPSU^^ 
KPTETP PVKBTQQEPDEBSLVPSGENLASETKTESAKTEQPS PA 
LI^ETPLEPAAGPKAACPLDSESVEGVVPPASGGGRVQNSPPVG 
RKTLPLTTAPEAGEVTPS DSG6QEDS PAKGHS VRLBPDYSEDKS 
SWDNQQENPPPTKKIGOPVAKMPI^PKMKKTPBKIJDNTPASP 
PRS PAEPNDI PIAKGTYTKD I DKWDDPNFNPF5STSKMQBS PKli 
P<X>SYNPDPDTCDESVDPFKTSSKTPSSPSKSPASFBIPASAME 
ANGVDGDGLNKPAKKKKTPLI-CTDT FR VKKSPKRSPLSDP P 3QDP 
TPAATPETPPVISAVVHATDHEKLAVTNQKWTG^^ 
YPOPSDLSTFVNETKFSS PTEELD YENS YE IB YMEKIGSS I»PQD 
DDAPKKQALYLMFDTSQESPVKJS^PVRMSESPTPCSGSSFEETE 
ALWTAAKNQHPVPRGLAPNQESHLQVPEKSSQKELEAMGLGTP 
S EA I EI TAPEGS FASADALLS RLAHPVS LOGALDYT4EPDLAEKN 
PPLFAQKLQRBAAH PTDVS I SKTALYSRIGTAEVEKPAGLIiFQQ 
PDLDSALQ1ARAEI ITKBRE^/SEWKDEYEBSRREVMEMRKI VAE 
YEKT IAQM IEDEQRBKSVS \HQTVQQLVIiEKEQA\ IjADLNS VE K 
\SLADLFRRYEKMKBVIiKGPRKWEBVLKRCAQBYL^ 
YQALKVHA\ KB KLDRANAB\ IAQVRG KAQQEQAAHQASLABRS S 
CRv\DALBRTLEQKNKEIBE1jTKICDELIAKMGKS 


5395 


3135 


531 


RASBAXNQEGLLNTRRKSTDSVPiSk^T^ 

S SGNPBAVAIAPDAYSTGS SSASSTLKRTKKPRPPS LKKKjQTTK 

KPTETPPVKBTQQEPDEBSLVPSGENIiASETKTESAKTEGPS 

LLEBTPLEPAAG PKAACPLDSES VEGWP PASGGGRVQNS PPVG 

RKTLPLTTAPRAGBVTPSJlSGGQEDSPAKGHSra^ 

Sm^QQKNPPPllCKXGKKFVAKMPLRRPKMKKTPEKLDNTPAS ? 

PRSPABPNDIP IAKGTYTFD1DKWDDPNPNPFSSTSKMQESPKI. 

PQQSYNFDPDTCDESVDPFKTSSKTPSSPSKSPASFEIPASAME 

ANGVDGDGLNK^AKKKKTPLKTDOTO 

TPAATPBTPPV I SAVVHATDEEKLAVI^QKWTCMTVDLBAt)KQD 
YPQPSDLSTPVNETKFSSPTEELDYRNSYEIBYMEKIGSSLPQD 
DDAPKKC3AIiYI^!FDTSQBSPVKSSPVRMSBSPTPCSGSSFEETE 
ALVNT»AXNOTPVPRGLAPNOESHIX^ 

SBAIBITAPEGSFASADAIliSRIiAHPVSLaW.DYLEPDIiAEKN 
PPLFAQKLQRRAAHPTDVSISKTALYSRIGTAEVEKPAGLLFQQ 
PDLDSALQIARABI I TKERBVS EVIKDKYEESRREVMEMRICIVAE 
YEKTIAQM I EDBQRSK6V^\HQTVQQLVLEKEQA\LADLNSVEK 

YQALKVHA\BEKLDRANAE\ I AQVRGKAQQEQAAHQASLAERSS 
CRVXuALERTLEQKNKEIEELTKICDELIAKMGKS 


5397 


3135 


531 


RASDAKMQEGLLNTRRJCSTDSVPISKSTLSP^ 

SSGNPEAVALAPDAYSTGSSSASSTI^TKKPRPPSIJCKKO^^ 

KPTETPPVKBTQQEPDBESLVPSGBNLASBTKTESAK^^ 

LLBETPLEPAAGPKAACPLDSESVKGWPPASGGGRVQNSPPVG 

RXTLPLTTAPEAGEVTPSDSGGQEDSPAKGHSVRLBFDYSEDKS 

SWDNQQENPPPTKKIGKKPVAKMPLRRPKM^ 

PRS PABPNDI P IAKGTYTFD IDKWDDPTfFNPFSSTSKMQESPKL 
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trjcfc^uicuecx 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


cc cui cueu end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine , C-Cysteine, D-Asparfcic Acid, E= 
Glutamic Acid, F=Phenylalanine, G^GXycine, 
H=Histidine, I=Isoleucine, K»Lysine, 
L=Leucine, M=Methionine, N=Aeparagine , 
P=Proline, Q=*Glutamine, R«Arginine r 
S=Serine, T=Threonine, V^Valine, 
VMryptopban, Y^Tyrosine, X ^Unknown, *-Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PQOSYNFDPDTCDBSVDPFKTSSKlPSSPSlWPASPBIEASAMB 
ANGVIXSIXSIJ^PAKKKKTPLKTDTPRVKKSPKRSPLSDPPSQDP 
TPAATPBTPPVI SAVVHATDEKKIAVTNQKWTCMTVDIiEADKQD 
YPQPSDIiSTFVNETKFSS PTBBLDVRN5 YEIBYMBKIGSSIiPQD 
DDAPKKQALYIMFDTSC^SPVKSSPVRMSESPTPC^GSSFBETB 
ALVNTAAKNQHPVPRGEjAPNQBSHLQVPEKSSQKELEAMGIjGTP 
SBAIEITAPEGSFASADAlASWiAHWSI£GAIJ)yi^DlJffi^ 
PPLFAQKltQRKAAHPTDVSISKTALYSRIGXTuTVEKPAGLLFQQ 
PDLDSALQIARAKIITKEREVSBWKDKyEESI^KVMEMRKIVAE 
YEKTIAQM I EDEQREKS VS \HQTVQQLVLBKEQiA\LADLNSVBK 
\SUu^LFRRYEKMKEVIiEGFRKNEKVLKRCAQBYl^RVKKEEQR 
YQALKVHA\EEKLDRANAE\IAQVRGKAQQEQAAHOAStiABR£S 
CRV\DALERTLBQK»KEIBBbTKICDELIAKMGKS 


5398 


56 


5426 


SGBVCRMESNFNQEGVPRPSYVFSADPIARPSBINFDGIKIiDLS 
HBFSLVAPNTEANS FBSKDYIiQVCLRlRPBTQSBKELBSEGCVH 
ILDSQTVVLKEPQCILGRLSEKSSG\QM\AQKPSFFPGPLGPAT 
TQKF^FQGCI^P\VKDIJ,KGQSRI*IFTYGLTNSGKTYTFO<n , B 
F^IRILPRTLtA^FDSLQFJlIjYTKM^KPHRSRRYliRIjSSEQEK 

f^iasksallrqikevtvhndsddix-ygsltwslnisefeesik: 

DYEQANLNMANS IKFSVWVSFFEI YNKYIYDI.FVPVSSKFQKRK 
MliRLSQDVKGYS FI KDLQWIQVSDSKBAYRLLKLGIKHQSVAFT 
KLNNASS RSHS I FTVK.I LQI EDSQ4SR VIRVSELSLCDLAGSER 
TMKTONEGBRIJlETGNIOTSIiTL^ 

FRESKLTHYF/QSFFNGKGKI CMIVNI SQCYLAYDETIiNVLKFS 
AIAQK^CVPDTI^SSOBKIjFGPVKSSQDVSLDSNSNSKILNVKR 
ATISWENSLEDLMEDEDLVEELENAEETED/VGETKLLDEDLDK 
TliEENKAFI SHEBKRKLLDL lEDLKKKTiINEKKEKLTLEFKI RE 
EVTQEFTOYWAOREADFKETIiLQERE I LEENAERRliAI FKDLVG 
KCDTRE EAAKDI CATKV8TE BATACLELKFNQ I KAEIAKTKGEL 
I KTKEELKKRENESDSLIQEI»ETSWKKI I TQNQRI KEI»INI IDQ 
KEDTCNEFQNIJ<BHMENTFKC^^ 

PXDSKSKI CS3RKRVMENBLQQDBP PAKKGS IHVS3AITEDQKK 
SBEVRPNIABIFJ)IRVLQENNEGLRAFLLTIENBLKNEKEEKAE 
LNKQIVHFQQELSliSBKKNLTLS KBVQQlQSNYDIAIABXiHVOK 
S KMQEQEE KIMKLSNB I ETATRS I TNWSQ IKLMHTKIDFXRTL 
DSVSQ1S& IDLLNLRDI*SNGS EfBDNLPNTQ LDLIiGjND YTj VSKQV 
KEYRIQE PNRBNSFHSS IEAi WEECKBI VKASSKKSHQIEELEQ 
QIBXI<32ffiVK&y]a>EHllRLKERE 

LQRKNVTLDVQIQHVVEGXRAI»S ELTQGVTCYKAK I KBLETILE 
TQKVERSHSAKLEQDILEKES I ILKI*ERNLKBFQHHIjQD8VKNT 
KDUjTVKELKLKEIiSITQLTNNI^ 
LKEBI^ASSARTQN\ia^IjQJlKEEDYADLKB^ 
QKEVSVMRDEDKLLRI BClNELEKKKNQCSQKLDMKQR\TIQQIiK 
EQLINQKVEEAI QQYERACKDIiNVKEKI I EDM RMTLEEQEQTQV 
aijLHjv jj \adiAJjStSVDiLunJ. nLit /tintC V ivLarUijiS L Kiw^KoWKiSixK 

NimJVLGKLTNLQDBLQKS BQK YNADRKKWLEEKMML ITQAKEA 
ENIRNKF^KKYABDRBRFFKQQNBMEILTAQLTEKDSDIiQKWRB 
BPJ3QLVAALEIQLKALISSNVQKDNEIEQLKRiISETSKIETQI 

STENDG^TRFPKPEIiEIQFTPI^PNKMAVKHPGCrrTPVTVK^ 
ARKRKSNEMEEDLVKCKNJCKNATPRTNLKFPISDDRNSSVKKEQ 
KVAIRPSSKE7TYSLRS0AS 1 1GVNLA.TKKKEGTLQKFGDFLQHS 
PS ILQSKAKKI IETMSSSKLSNVBASKEWSQPKRAKRKI.YTSB 
ISSPIDISGQVILMDQKMKESDHQI IKRRLRTKTAK 


5399 


705 


230 


GPRMAKFI^QIXJINF^nCECFSbYDKQQRGKIKATDLMVAMRCLG 
ASPTPGBVQRHWJTHG IDGNGBliDFSTFLTIMHMQIKQEDPKKE 
ILLAMLMVDKEKKGYVMASDI»RS KLTS lJGEKLTHJCEV\ DDI>FRK 
\ADIEP»GK\nCYl)EFIHKIT3YLDGTY 


5400 


931 


248 


SHGSSGNKI PPTNYPASPJU^LVAQNYINYQQGTPHRVFBVQKVK 
OASMEDIPGRGHKYRLKPAVBBI IQKQVKVNCTASVLYPSTGQE 
TAPEVMFTFBGETGKNPDBEDOTFYQRLKSMKBPliBAQNI \PM 
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SEQ 
ID 
NO: 


beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residua of 
amino acid 
sequence 


piredxcted end 

nucleotide 

location 

s*f\y~Y~& c?t"ir>Ti "i fin 
tut JL t£o£J\J JlUJLU^g 

to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(Alanine, C=Cysteine. D=Aspartic Acid, E~ 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 

n~ni.oLxuj.iJc, .L^-LoOJLcVlCXne , J*.— uysxxie , 

L=rLeucine, M=»Methionine, N=Asparagine, 
P»Proline, QsGlutaraine, R^Arginine, 
S=Serine, ^Threonine, V-Valine, 
WoTryptophan, ^Tyrosine, X=Unknown , *=Stop 
Codon, /^possible nucleotide deletion, 
\ -possible nucleotide insertion) 








Q]^DFIKU3YTILUIWIAS0BIIPW0MQVI,miPQYGTKVKHNS 
RLPKEVQLE 


5401 


3 


1360 


TGWSYGPTTSIiAFIxAPRDFPFPPKI^IHPQAVVRLSCXjAGSMGS 
QAAAEWRNWASVnsGSSSIiSGCSM 

KWAPRQDDMLFYVRRKI^YSGSESGM)GRKAAEPEVEVfiVYRRI) 
SKKLPG:iXDPDIDWEESVCLNLIL^ 

HIHKKKSQQVFASPSKHPMDSKGSESKISYPNIFPMIDSP\KE\ 

VFSmTVGKGEMVCVELVASDKTNTFQGVIF 

DmvSVAAP^lAQK\MSFt5FSKYSNM^\W\MraPQGKGHABMA 

VSRVSTGDTS PCGTEEDSSPAS PMHERVTSFSTPPTPERNNRPA 

FFS PSRiKRKVPRlHlIAEMiaCSHSANDSEEFFRE^ 

NIiRSRSIxSGTGRSLVGSHLKLNHADGNF 

LTDILEVRQKPILMT 


5402 


3445 


1563 


GECFlMAAWQQNDiOVFBFASNVMKDKKQLGDPAIPPAVIVEHV 

PGADIIJfS YAGIACVBKPJtfDM I TBSSLDVAEEB I IDDDDDDITL 

TVFJVSCHIXSDETlF/riEAAEALLNM^ 

PEDDMVVAPVTHVSVTLIX5IPEVMETQQVQBKYADSPGA^ 

PICRKKGRKTKPPRPDSPATITNISVKKKNXI)^ . 

AI.LQDKATCPKYIKWTQRBKG I FKtiVDS KPVSPJl»MRKHKNKP\D 

MHTOPMGRAIiRYYYQRGIIiAKVEG(^WQFKEMPKDr.iyiNDB 

DPSSSIESSDPSI^SATSNRl^SRSmreSSPGVRGGATTVI.K 

PGNSKAAKPKDPVEVAQPSBVLRTVQPTQSPYPTQLPRTVHVVO 

PVQAVPEGEAARTSTMQDETLNSSVQSIR\TIGAWQVPVVVSP 

RN<X?\l^TVTLQTVPLTTV'IASTDPSAGTGSQKFrLQAIPSSQP 

^^I^KF^™r ^ QSQKAGSPPSIVIlGPARV\OQVLTSNVQTICNGT 

VSV\ASSPSFS\ATAPVVtLF7^LGSSQliVAHPPGTVITSVXKrQ 

ETK?LTQEVBKKESBDHIJCEOTEKTEQQPQPYVMVVSSSNGFTS 

QVAMKQNELLEPNSF | 


5403 


3445 


1563 


GKCFIMAAWQQMDLVFEFASNVMEDE^QLGDPAIFPAVIVEHV 
PGAD I LNS YAGLACVE E PNDM ITES S LDVAEEEI IDDDDDDITL 
TVBAS CHIXn>ETI ETIBAAFALLNMDS PGPMLDEKRIKNNl FSS 
PBDDMWAPVTHVSVTLDG I PEVMETQQVQEKYADSPGASSPEQ 
PKRKKGRKTKPPRPDSPATTPNTSVKJOO^KDG 
ALLQDKATCPKY I KWTQREKGIFKLVDSKWSRLWRJCHKNKP\D 
MNYEPMGPJVLRYYYtiRGirxAKVFW 

dpsssiessdpslsssatsnrnqtsrsrvssspgvkggattvlk 
pgnskaakpkdpvevaqps eviirtvqptqspyptqlfrtvhvvq 
pvgavpegbt^arts rmqdetlnssvqs ir\tiqaptqvpvvvs p 
rho^\lhtvtlqtvplttviastdpsagtg^kfiloaipssqp 

MTVIJCENVMI^QKAGSPPSIVIjGPARV\QC^^ 

V£»V \£U*k>Jr£>i?£> \/\.L>\lr V V 1 Jjr ljlA^ayjjV-ftnjVVvjl V J, XoV-LiviU 

ETKTLTQEVEKJCES ElfflLKEinrEKTEQQPQPYVMVVSSSNGFTS 
QVAMKQNELLEPNSF 


5404 


1B7 


1111 


LPVTLI FAKM^ri^STLIiLIoLLVPLIKPAPPTQQDSRI IYDYGT 
DNFEES I FSQDYEDKYIJX5KNI KFJCEWI IPNEKSLQLQKDEAI 
TPLPPKKENDFMPTCLI^CLSGSVYCBEVDII^VpPLPKKSAY 
I/J£ARFNKIKKLT\AKI)FADIP^ 

SLVEBLSIiABl^LLia.PVLPPKI.TIiFXIAICYWKIKSRGI 

lTT.Wi\iT/FirT»VTX<HMAT J?C\7DrJ4T.PKQr..WV^ 

IVLlF1.lv Jj J. r U I J_»i-aui/^ 1 i'ioV rijii Julr dc>jlm\ V XHljUCiwf Iiwl J-JL/JVi. V 

CKANDTSYIRBR1EEIRI^X?NPIVLGKHPNSPICLKRLPIGSYF 


5405 


2199 


1220 


QNSRSUIMDPC^QHGSGSSLWIG^PSLDSRPRLDYEREIQPTA 
IIiEU3QIKA1RGSNEYTEGPSVVKRPAPRTAPRQBKHERTHE1 I 

NSS ASS EO<5Ij1XRS PPTRPVPGHRSERA1RTQPKQL I VX)DLKGS 
LKEDLTQHKFI CEO^KCKCGECTAPRTLPSCIJ^CNP^CLC^AE 
SMVEYGTCMCIi\VKG I F YHCSNDDEX3DS YSDNPCSCSQSHCX23R 
YLO^GAMSLPLPCLLCYPPAKGCLKLC^RRCn) 
NTVYCKLESCPSRGQGKPS 


5406 


279 


2732 


rwrtynvegpltfmdvaief<^ewc^dtaqc^y 
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SEQ 
ID 
WO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


amino acid segment containing signal peptide 
f A^Alanine , C=Cysteine, D-Aspartic Acid, E= 
Glutamic Acid, P=Phenylalanine , G=Glycine, 
H=Histidine, I»Isoleucine, K^Lysine, 
Ii=Leucine, M=Methionine , N«Asparagine , 
P= Proline , Q^Glutamine, R-Arginine , 
S= Serine, Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








RNLVPLG/ X IAVSKPDLITCLEQEKEPWEPMRR11EMVAKPPVMC 
SHTOJDFttPEQHIKDPFQKATIiRRYKNCEIJKNVHljKKDHKSVDK 
CKVHRGGYNGPWQCIiPATQSKIPIjFDKCVKAFHKPSNSWRHKIS 
HTEKKXiFKCKECXJKSFCMLSHLAQE^^ 

NCPS I ITKHl0iIbnX5EKPYTCEBCGKVFNWSSRLTTHKKNYTRY 
KLYXCEECGKAFNKSS ILTTHKI IRTGEKFYKCKECAKAFNQSS 
^TEHKKIH^fiKPYKCEEOGKAFNWPSTLTKHK^HTGEKPYT 
CEECGKAFNQPSNLTTHKR rHTA\EKFYKCTEC3GEAFSRS \SNL 
TKHKEIHTEKKPYKCBKCGKAFKWSSKLTBHKLTHTCEKPY1CCE 
KC^KAFNCPSIITKHNRINTCBKPYTCEECGKVFNWSSRIjTTHK 
KNYTRYKLYKCEECX3KAPNKSSILTTHKKIHIEKKFYKCEECGK 
AFKWSSKLTKHKITHTGEKPYXOBBCKKAFNHFSILTKHKRTHT 
GBKPYKCEE CXJlQVFTQSSNUrrHKKXHTGB KF YKCEECX5KAFTQ 
SSNLTTH KKIHTGGKP YKCBECGKAFNQFS TLTKHKI IHTEEKP 
YKCEECGKAFKWSSTLTKHKI IHTGEKPYKCBBCG\KAFKLSST 
LSTHKI IHTGEKPYKCEKCGKAFNRPSNLIBIIKKII^TGBQPYKC 
EETOKAFNYSSHLimiKRIHTKEQPlXCKBCGKAF^ 
NKIHTGEKLYKPKDVTVI LTTFQTFSNI K 


5407 


3 


659 


RPRRRQSSCCTGWLAGWLJhRAAPRFC^ 

AJIIJ^QGTLAQSIKBNHLVKVYDYQEIX^SVLLTCTUVEAKNITO 
KQGKMIGFLTEDKKKHN^ 

YRMCQNCIBLNAATISGFLFAEIVS I FDLAVGVYFIAGTGMEFR 
QS \RAS DKQTLLP \NDPAPTQPLKDPPJCMIXJYSKLQGN \Q LRRN 


5408 




6128 


QGSKGTCHPQAQQpWDEOVWQEAPSQSBPWGOSQEPPTMPQRIaP 

KARQHTPIaPliGSADYRJRWSVRPQGPHRDPKDSRDAAKRBQGS h 

APRPVPASRGGKTIiCKGYRQAPPGPPAQFQRPICSASPPWASRF 

STPCPGGAVRBDTYPVGTQGVPSIJUiAQGGPQGSWRPLKWKSMP 

RLPTDLDIGGPWFPHYDFERSCWVRAISQEIXJLATCWOAEHCGE 

VRNKDMS WPEEMS FIANSS KI DRHKVPTEKGATGLS NLGNTCFM 

NSS IQCVSNTQPLTQYFI SGRHLYELNRTNPIGMKGHMAKCYGD 

LVQBLWSGTQKNVAPLKLRWT IAKYAPRFNGFQQQDSQELLAFL 

LDGI^DI^VllEKPYA^KDSDGRPDWEVAAEAm3lraiiRRNRS 

rVVDLFHGQIiRSQVKCKTCGHISVRFDPFNFLSLPLPMnSYPIHL 

BITVI KIjDGTTPVRYGLRLNMDEK^ h 

lAEVHGSNIKNFPQDNQKVRI^VSGFLCAFKI PVPVSPISASS P 

TQTDFSSS PSTNEMFTIiTTNGDLPR P I F I PNGMPNTVVPGGTE X 

NFTNGMVNGHMPSLPDSPFTGYI IAVHRKMMRTBLYFLSSQKNR 

PSI^GMPI*rVPCTVHTRKXDI>rI^^ 

AQDCDI«MGYGYPFnJ*WQKDGNSC^<^ 

DRAFIGNAYIAVDWHPTALHI»RYQTSQEliVVDEHBSVBQSRRAQ 

VEPINI>DSCti^FTSEEEIX5ENFJ4YYCSKCKTHCIA 

LPPILIIHLKRPQFVNGRWIK^QKXWCFPRKSFDPSAFL.VPRDP 

ALCCHKPLTPQGDELSEPRrrAREVKKVnAQSSAGEEDVLLSKS 

PSSLSANriSSPKGSPSSSRlCSGTSCPSSKNSSPNSSPRTLGRS 

KGRO?LPQIGSKNKI*SSSKENLDASKF2fGAGQICELADALSRGH 

VLGGSQPELVTPQDHB^AlJaiGFk 

I^BEDSTDDQRBDTRrKPIYl^YAlSCHSGlIjGGGHYVTYA^ 
PNCKWYCYNDSSCKELHPDEIDTDSAYILFYEQQGIDYAQFIiPK 
TDGK3^OTSSMDBDFESDY\BKYCVLQ 


5409 


2745 


6128 


QGSKG^CHPQAtXJPWDEGWQEAPSC^BPWGQSQEPPTMPQRLP 
HARQHTPLPI/5SADYRRVVSWPQGPHRDPKPSRDAAKREQGSI, 
APRPVPASRGGKTLCKGYRQAPPGPPAQFQRPICSASPPWASRF 
STPCPGGAVREI>rYPVGTQGVPSLAIiAQGGPQGSWRFLEWKSMP 
RIiPTDIiDIGGP WFPHYDFERS CWVRAISQEBQI»AXCWQAEHGGE 
VRNKIMSWPEE^FIANSSKIDRHKVPTBKGATGIjSNIX?NTCFN 
NSS IQCVSNTQPLTQYFI SGRHLYELNRTNP I GMKGHMAKCYGD 
liVQBI^SGTQKNVAPLKLROT 1AKYAPRFNGFQQ0JDS QELLAFL 
LDGI iHKD LN R VI {EKP YVE LKD S DGRPDWKVAAE AWDN I LLRRNRS 
IVVDI^GQIJ^VKCaCTCGHlSVRFDPFNFI^SLPLPMDSYMHL 
BI TVTKLDGTTPVR YGLR LNMI>EKYTGLKKQI*$DIjCGIiNSBQI Ii 
IAEVHGSNIKNFTQDNQKV^SVSGFX^CAFEJPVPVSPISASSP 
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SEQ 
ID 
NO: 


irttiui c tea 

beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amine acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(At Alanine, C=Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F= Phenyl alanine, (^Glycine, 
H=Histidine, I»Isoleucine, K=Lysine, 
L=Leucine, M~Methionine, N=>Asparagine, 
P=Proline, O-Glutamine, R^Arginine, 
S=Serine, T^Threonine, V=Valine, 
w^Tryptophan, Y=Tyrosine, X=tJnknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=pos3ible nucleotide insertion) 




• 




TQTDFSSSPSTNKMFTLTTNGDLPRPIFI PNGMPNTWPCGTEK 
NFTNGMVNGHMPSLPDSPFTGYI IAVHRKMMRTELYFLSSQKNR 
PSLFX^PLIVPCTVHTRKKDIiYDAVWIQVSRLAS PLPPQEASNH 
AQDC35DS^K^QYPFTLRWQKI)(a^SCAWCPWYllFCRGCKIDCGH 
DRAF 1 6NAY lAVDmPTAljHtiR YQTSQERVVbKHKS VEQSRRAQ 
VBPINIJJSCLRAPTSEEELGENEMYYCSKCICrHCLATKKLDLTO 
I^PILIIHLKRFQPVI^RWIKSQKIVKFPRBSFDPSAPLVPRDP 
ALCQHKPLTPQGDELSE PRI LARBVKKVDAQSSAQBKDVI,LS KS 
PSSLSANIISSPKGSPSS3RKSGTSCPSSKNSSPHSSPRTLGRS 
KGRLRliPQIGSKNKLSSSKBffliDASIG^ 

VLGOSQPEIjpVTPQDHBVAbANGFliYEHEACGNGCGNGYSNGQI^ 
NHSBEl)STI^REin^IKPIYmiYAISCHSGILGGGHYVTYAKN 
PNCKWYCyNDSSCKmjHPDEIOT'DJ^YILPYEQQGIDYAQPLP^ 
TDGKKMADTSSMDEDFESDY\EKYCVLQ 


5410 


2 


710 


I*RFPGOJUfflVWLAARM0APHKEHLYKLI»VIGD I IKRY 
VHQNPSSHYRATIGVDPAIJWIJJVroPET\nmLOl,WDIAGQERFG 
HhrmVYYEBAMGAFIVPDVTRPATFEAVAKWKNDLDSKLSLPNG 
KPV5VVl»LAmaxX3KDVIiMNNG^ 

ENINIDEASRCLAmnLANECDLMES I E PDWKPHLTSTKVASC 
SG\CAKI IiVGTFAGVW 


S411 


1302 


209 


TGPAAAGRRKALGSFGKPS pvtglraarrrrtrpsapaaps vgc 
GKRRKSDAGAGGERASVRTGSGRRGGRTMAGDSBQWjG^ 
GGEFFLIGVSGGTASGKSSVCAKIVQLIjGQNEVDYRQKQVVIIiS 
QDSFYRVLTSEQKAKALKGQFNFDEPDAFDNKL ILKTLKEITEX5 
KTVQI PVYDFVSHSRKEETVT V YPADVVLPEGI IjAFYSQBR / IR 
DLFQMKLFVDTDADTRLSRRVliKDISERGRDLEQILSSSTIiRFV 
KPA\FBBFCLPPK\KYADVI I PR\GADN\RVPINLI VQHIQ\DI 
LNGGPS\NRQTNGCLNGYTPSRKRQASESSfiRPH 


*412 


3180 


313 


QGISNFFHKBANFWFBVSG YI*ISPLRSPPVDFAfcB»SLMAS PWN 

KMECBSSRFEII1TPVSDKKKKKCSIHKERPQKHSHEIFRDSSLV 

NEQSOlTRRKKnRKKDFQHLISSPIaKKSRICDETAKATSTLKKRK 

KRRYS ALEVDEEAGVTVVLVDKENINNT PEOQ^ 

IBQICLPRK\PKTDKF^\^AJCSH\AHKSEAr^KVREKKNKKHQR 

KAASKESQRA\RIXrLPQjSEFPTQKESWI*SVGPGGEITBI»P\ASA 

HKNKS KKKKJCKSSKREYET\IiAMPEGSQAGREAGTOMQESQPTV 

GU)DETPOIiI^PTHIOCKSKKKKKKKSNHQBFBSLA^ 

EVGADMQES \RPAVGLHGETAGI PAPAYKNKSKKKKKKSNHQEF 

FAVAMPESI^SAYPE^SO^GSEWrVBGSTAI^FKESNSTiaCK 

SKKRKLTSVKRARV3GDDFSVPSKNSBSTLFDSVE 

VKS RPRQKKlt^CIiAS KHVQEAPRLEPANEEHWV^ RY 

LSADSGDADDSDADLGSAVKQLQEFI PNI KDRATST 1 KRMYRDD 

LKRFKEFKAQGVAIKFGKFSVXENKCH^ 

KLLYTDRYPBEKSVI TNLKRR YS FRLHIG \ RNIARPWKLI YYRA 
KKMFDVmYKGHYSEODTKKija*^ 

SVALKFSQISSQPJfRGAWSK5ETRKLIKAVEEVTLKKMSPQBLK 

BVD3KLQENPBSCLS IVREKLYKGISWVEVEAKVO/rcNWMQCKS 

KWTEILTKRMTNGRR I Y YGMNALRAKVSLr ERI»YE IttVKDTNE I 

DWEDLASAIGDVpPSYVQTKFSRiKAVYVPFWQKKTFPBIIDYL 

YETTI^LLKEKLBKMMEKKGTOIO/reA^ 

S EGGGHRKRKRRPRRHAWPTP V I PVLWEAKAGWI I 


5413 


3753 


1304 


RFPAGVAPRRAMANVSKKVS WSGRDRDDEBAAPIjIiRRTARPGGG 

TPIiIiNGAGPGAARQSPRSALFRVGHMSSVICbDDEIiLEP\DMDPP 

HPFPKEIPHNEKLLSLKYF^LDYDNSENQLFLEEERRIIJHTAFR 

TVB I KRWVX CAIjIG I LTGIjVAC FID I WENLAGLKYRVI KGN I D 

KFTBKGGLS FS UjLWATLNAAFVLVGSVIVAP I BPVAAGSGIPQ 

IKCFLNGVKI PHVVRLKTLVI KVSGVILSVVGGLAVGKBGPMIH 

SGSVIAAGISQGRSTSLKRDFKIFEYLRRDTEKRDFVSAGAAAG 

VSAAFGAPVGGVLPSLEEGAS PVJNQPLTWRI FFASMISTPTLNF 

VLSIY^lGNMWDLSSPGIiINFGRFDSBKMAYTIK^IPVFIAMGW 

GGVLGAVFNALNYPnjTMFRIRYIHRPCLQ 

FVLIYSSRDCQPLQGGS^YPIX}LFOu^EYNSMAAAFFtm'EK 
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SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(As=Alanine, C-Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G~Glycine, 
H=Histidine, I^leoleucine, K^Iiyaine, 
I»=Leucine, M=Methionine, N=Asparagine, 
P« Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y»Tyrosine, X=Untanown, *«=Stop 
Codon, /-possible nucleotide deletion, 
\~possible nucleotide insertion) 








SVVSU^PPGSYNPLTUXFTIAnfFF^ 
SLLIGAAWGRL?GISI*SYIjTGAAIWaDPGKYAIiMGAAAQLGGIV' 
RMTLSLTVIMMEATSNVTYGFP I MLVW1TAK I VGDVT I EGLYDM 
HIQI^VPPIJIWEAPVTSHSLTiUiEVMSTPVTCI^REKVG\^ V 
DVLSDTA&NHNGFPWEflAIH^ 

FVERSNLGLVQRRIiRLKDPRDAy PRPPP 1QS IHVSQDERECTKD 

&SBFMNPSPYTVPQEASLPRVFKI*FRAIX^ 

^VTRKDIiARYRLGKHGLEBLSIiAQT 




2130 


390 


GVASAWDRA.Ii FS PIXS PTSRVPRT3 PPRCVSTETGRRDRAR VPS 
QWCSVI^ia,PVS(^TSl»ACVRS ILLSPASSPRKVGIVGGTGAR 
AGAAPRDHGRVRl£RRPSSARR>rreTTGQCIiAPRGOQGPRGTRS P 
RSPRSRTRRGCSASPACLP /CRSAI»I VAVLCYINf kliNYMDRFTV 
AGVLPDIEQFFNrGDSSSGIJIOTVPISSYMVIAPVFGYLGDRYN 
RKYLMCGG1AFWSLVTLGSSFI PGEHFWLLLLTRGLVGVGEASY 
STI7VPTLIADIJ?VADQRSRMLS I FYFAlPVGSGLGYIAGSKVKD 
MAGJ^WHWALRVTPGLGVVAVt^FLVVREPPRGAVBRHSDIiPPL 
NPTSWWADLRAIxARNPSFVLSST/SFTAVAFVTGSLALWAPAFLi 
RSRWLGETPPCLPGDSCSSSDSIil PGLITCLTGVLGVGIiGVEI 
SRRI»RHSNPRADPLVCATGLnGSAPFLFLSIACARGSlVATYIF 
IPIGETLIiSMNWArVADILLYWI PTR^STAEAFQIVLSHLLGD 

agspyiiiglisdrij«nwppsfi>srfrai£fslm^^ 
ajvflgtahlh 




693 


2966 


IPPKTKLKLQKHXL'PTLTXNQEQATIFEBVQKLRPRNEQRBNEL 
IISPI^CLPBBKQKIMIHIGRMKQTSQMAAEHIGSELPPSArRF 

STLSNTS KKPS VCE KBALP I SES S PKLLGSS EDLS SDSBSHTiPE 

KPAPLSPOX}AFRj?RANTLSHFPIECQE?PQPARGSPGVSQRKLM 

RYHSVSTETPHBRKDFESKANHLGDSGGTPVKTRRHSWRQQIFL 

RVATPQKACTS 3 SRYEDY SELGELP PRSPLEPVCE3GPFGPPPE 

EKKRTSRRLRHLWQKAILQQI L1»LRMEKENQKI^ASENDLI»NKR 

LKLDYEE I TPCLKE VTTVWEKMDSTPGRS KIKFDME KMHSAVGQ 

GVP\RHHRGBlWKFLAEQE^IiKHQPPSKQQPKDVPyKELIiKOLT 

SOX^HAILIDLGRTFPTHPYFSAQLGAGQLSLYNILKAYSUJJQE 

VGYCQGIiSFVAGIl^UilMSEBEAFKMUCFIjMF^ 

IILQIOMYQLSRLLHDYHRDLYNHLKBHKIGPSLYAAPWFI.TMF 

ASQFPI^FVARVFDMIFliQ/yrEVIF^^ 

LETrVDFIKSTLPNLGLVQMBKTlWQVFEMDXAKOliQAYEVBYH 
VI^KKLIDSSPLSDNURMDKl^KTNSSIiRKQNIJ^IXEQLQVANG 
R I QSIiKAT I FJKKLSS B S BCLKQAMLTLE LER S ALLiQTV BELRR RS 
AKPSDREPECTQPEPTGD 


5416 


27 


4074 


XSQLFCFWGGKAGDI LSGDQDKKQKDPYFVETPYGYQLDLDFLK: 

YVDDlQKGl^IKRI^QKRRKPSVPCPEPRTTSGQQGrWTSTES 

I^SSNSDDNKQCPNFLIARSQVTSTPISKPPPPLETSLPFLTIP 

FJn?QLPPPSPQt>PKHNIJIVTFrKjMETRPJUiEQlSRAT^ 

RR PRLASFGGMGTTS SLPS FVGSGNHNPAKHQZjQNGYQGNGD YG 

SYAPAAPTTSSW3SSIRH3PIiSSGISTPVTNVSPMHW2HIRBQM 

AIALKRLKELEEQVRTI PVLQVKI SVLQEEKRQI»VSQliKNQRAA 

SQINVCGVPJOlSYSAGNAS^tiEG^^SRARJiSGGKLYIPYEEEEME 

TVEQSTQRIKBFRQL\TADMO^LEQKIQDSSCSASSELRENGEC 

RSVAVGAEENMNDIVVYIIPJGSP^CKI^ 

AMLGVMTEAJDKE IKLCXX^I ES LKEKI YRLEVQLRETTHDREMT 
KIjKQEIjQAAGSRKKVDKATMAQPIjVFSKVVEAW 

mm»votcvgt5vetnsvgisc0pe^^ 
vemhbrcagrsx^ctksvsvbvsvx^tgsnteesvi^ltllct 

NIjNLKEVRS IGCGlX^VDWCSPKECJ^RGVNTEAVSQVEAAV 
MAVPKIW5QDTSTDLEQVHQPTNTB 

TSTQTVETRTVAVGEGRVKDINSSTterRS IGVGTLLSGHSGFDR 

PSAVKTKESGVGQININDNYLVGLKMRTIACX;PPQLTVGLT7^SR 

RSVGVGDDPVGESLENPQPC^LGIWTG^ 

TU*AENYSBIxAEAK5EPHSC^GSI^SQLISTI»SSIKSVMKSAST 

EEIJ^PDFQKTSI^KZTGSYLGYTCKOGGLOSGSPLSSQTSQPB 
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location 
corse spond ing 
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amino acid 
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amino acid 
sequence 


Predicted end 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Anu.no acid segment containing signal peptide 
(A^Alanine, (^Cysteine, D=Aspartic Acid, B=i 
Glutamic Aeid, P=Phenyl alanine, G«Glycine, 
H=Histidine, l=Isoleucine, K-Lysine , 
L=Laucine, M=Methionine, N^Asparagine, 
P=Proline, Q^Glntamine, R-Arginine, 
S-Serine, TVThreonine , V^Valine, 
W-Tryptophan , Y^Tyrosine, Unknown, *=»Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








QEVgTSI^ KP ISS FPTQEGTLSP VNL^DIX}IAAGL YACTNN 
ESlXKSIMKKKDGNKDSNGAKKKTiQFVGINGGYETTSSDDSSSD 
ESSSSBSDDECDVIEYPLEBEEEEBDBDTRGMAEGHHAVNIEGIi 
KSARVBDEMQVQECEPEKYB I RERYELS E KMLS ACWL LKNT IND 
PKALTSKDMRECLNTI>QHEWFRVSSQKSAI PAMVGDYIAAFEAl 
SPDVLRYVIN1^AIXjNGNTALRTSVSHSNFEIVKLLIJ1M3VCNVD 
HQNKAGYTPIMLAAIJUWEAEXDMRIVBEI»FGa3DV^ 
QTALMLAVSHGRinMVKGLIACGADVNlQDDEGSTAIJ4C^E^ 
HVE I VKLLIiAQPGCKGHLEDNDGSTALS XALEAGHKD IAVTiLYA 
HVN?AKAQS PGTPRLGR KT8 PG PTHRGS FD 


5417 


27 


4074 

* 


KSQnPCFWGGXAGDHiSGDQDKEQKDPYFVErPYGYQLDLDFIiK 
YVT3DIQKGNTIKRLNIQKRRKPSVPCPEPRTTSGQQGIWTSTES 
LSSSNSDDNKQCPNFblARSQVTSTPISKPPPPLETSLPFLTI P 
ENRQLPP PS PQI^KHNI JlVrKTI>METRRRLEQERATMQMTPGB F 
RRPRLAS FGGMGTTSS LPS PVGSGNHNPAKHQLQNG YQGNGDYG 
S YAP AAP TTS SMGSS I RHSPLSSGISTPVTNVS PMHIiQH I REQM 
AIALKRLKELEEQVRTI PVLQVKIS VLQBBKRQLVSQLKNQRAA 
SO INVCX5VRKR3 YSAGNASQLEQLS RARRSGGELY I DYEKEEME 
TVEQSTQR1KEFRQL\TADMQALEQKIQDSS CRASS ELRENGEC 
RSVAVGABENMNDIVVYHRGSRSCKDAAVGTIiVEMRNCGVSVTK 
AMLGVOTBADKEIRLOXXJTIBSLKEKTYR^ 
KLKQKLQAAGS RKKVDKATMAQPLWSKVVEAVVOTRDQMVGSH 
MDLVDTCVGTSVETNSVG ISCQPEO<NKW^ 
VBMHDRC^RSVEMCEKSTOVEVSVCETC 

NLNLKEVRS IGCGDCSVDVTVCS PKE CASRGVNTEAVSQVEAAV 
MAVPRTADQDTS TDL EQVHQFTNTETATIj I ESCTNT CI*S TIiDKQ 
TSTO/TVETRTVAVGEGRVKDINS STKTRS IGVGTLLSGHSGFDR 
PSAVKTKESGVGQININDNYLVGlira^RTIACGPPQIiTVGIjTASR 
RSVGVGDDPVGESliBNPQPQAPLGMMTOLDHYIBRIQKLIAEQQ 
TLLAENYSELAEAfGEPHSQMGSLNSQLlSTLSS INSVMKSAST 
EEIiRMPDFQKTSLGKITGS YLGYTCKCGGLQSGS PLS SQTSQPE 
QEVGTSEGKP I S S UDAFPTQEGTLS PVNLTDDQI AAGL YACTNN 
ES TLKS I MKKKDG1)WE)SNGAKKNLQFVG INGG YETTS SDDS S SD 
ESSSSESDDECDVIEYPLEEEEEEEDEDTRGMAEGHHAVNIBGL 
KSARVED^QYQBCEPEKVEIRBRYEI»SBKMI»SAC^LKNTXOTJ 
PKALTSKDMRFCLNTLQHEWFRVSSQKSAIPAMVGDYIAAFEAI 
SPDVLRYVINLADGNGNTALHYSVSHSNFE IVia^DADVCNVD 
HQNKAGYTPIMLAAIJ^VBAERimrVBEIJPG^ 
QTALMLAVSHGRI DMVKGLLACGADVNTQDDEGSTALMCAS El K3 
HVB I VXIiLLAQPGCNGHLEDNE)G3 TAL S I ALKAGHKDIAVLLYA 
IIVNFAXAQSPGTPRLGRKTSPGPTHSGSFD 


S418 


24 


1133 


SVPHAGGDMBTGAAETiYIXJAiaXSILQHVGlW 
K^FYRIiIiRHPSDRMGFPPGAAOALVLQVFlCTFDHMARQDDKKR 
RQKIaEKKIRRKEEEEAKTVSAAAAEKEPVPVPVQEIEIDSTTEt* 
DGHQEVEKVQPPGPVKEMAHGSQBAEAPGAVAGAAEVPR\EP?I 
LPRIQEQFXJRNPDSYl^AVRE^TWSODYTDLEWVPVPKHWK 
GKQ VSVAJLSSS S I RVAMLEENGERVTjMZGKLTinCINTESSLMS L 
BPGKCVtiVNLSKVGBYWWNAI LEGEBPIDIDK1NKERSMATVDE 
EBO^VLDRLTPDYHQKLO^KPOSHBLKVHEMIiKKGNDAEGSPFR 
GORFDPAMFNI SPGAVQF 


5419 


1395 


259 


GTHPLDPDnVSRTS VQGP LMTMACPGMSDTKES PFIjGPRAAEEG 
SESEA£EAFGRRKSEEEGRP^DTSGFGRSRKHF^^KHPERADA 
KDPASLPQC/I^P/DC^PAQPSSKYCSDIXX^KI^AANRIYEIL 
PQRI QQWQQS PCIAEEHGKKLLER I RREQQSARTRLQEMERRFH 
ELEAI ILRAKQX^WEDEESNEXjDSDDTDIXDIFCVSCGHP INPR 
VALRHMERCYAKYESOrSFGSMYPTRIEGATRLFCDVYNPO^ 
YCI^LQVIjCPEHS RDPKVPADEVCGC PLVRDVFELTC K 
RQCNRHYCWEKI^mAEVDLERVRVWYKI^ 
RAGlLAI^LHOTIQHDPLTTDLRSfiADR 


5420 


117 


1733 


NEAGGACPFTCGGASGRLYLSPRLPRVSVAGCEERPLGWVWVIiGG 
GGFLPARPPRAQRHLGFSKAEQSMEAPDYEVLSVREQLFHERIR 
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SBQ 
ID 
NO: 


beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predii cted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^ Alanine, OCysteine, D^Aspartic Acid, E*» 
Glutamic Acid, F=Phenyl alanine, G^Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M*=Methioniiie, N=Asparagine, 
P=Proline, Q=Glut amine , R=Arginine, 
S=Serine, T-Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine , X»unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








ECII31TjLFATLYILCHIFLTRFKKPAEFrT\GMMKMPPSTRL/ 

JLmjx&JUI* 1 rl laAl AJJb A v JjLiLi Frbi I X 5N:S VL.LS LPRNYY I QWLNGS 

IilHGI*WNLVFLFSNLSLI FLMP FAYFFTBSEGFAGSRKGVLGRV 

YKTVVMIiMLLTUjVLGMVWVAaAI VDKNKANRESIjYDFWEYYIiP 

YLYSCISFLGVLIJ^VCTPLGrjVWlFSVTC 

QLYCSAFEEAALTRRICMPTSCWLPIVDMEL^ 

LEKRRKASAWQRNZ^YPLAMIXZIiVLTGL^ 

EAAMPR6MQGTSLGQVSF^XI^SPG^VIQVVI»IFYIJWSSVVGP 

YSSPLFRSU*PRWHDTAmmiGNCVC^VI^ 

TK FDLLGDFGR PNWLCNF YI VFIiYNAAP7VGLTl"I>CljVKT FTAAV 

RAKLIRAFGERE 


5421 

* 


117 


1733 


NEAGGACPFKGGASGRLYLSPRLPRVSVAGCEKRPLGWWVLGG 
GGFLPARPPRAQRHLGFSHAEOSMEAPDYKVLSVREQLFHERIR 
KCI I STLIiFATIiYILCHI FTjTRPKKPAEFTT\GMMKMPPSTRLi/ 
LLELCTFTLA I ALGAVLLLPFS IISNEVLLSLPRNYYIQWLNGS 
LIHGLWNLVFLFSNLSIil FIiM PFAYFFTES BG FAGS RKCVLGRV 
YETVVMLM1^TLI,VIX3MVWVASAIVDKNKANRESI J YD 
YLYSCISFI/mJbl^VCTPLGUUWFS^ 

L£KRRXASAWQRNIjGYPI\AMIjCI4LVLTGI^ 

KAAMPRG^K)GTSLGQVSFSKI^SFGAVIGW,IPYLMVSSVVGP 

YSSPLFRSUlPRWHDrAtO^riGNCVCIiVIfiSALPVFSRTLGL 

TRFDLLGDFGRFNWLGNFYIVFIjYNAAF 

RAELIRAFGERE 


5422 


3 


1263 


e03BSLPTWLAGA3RPGlGRKGGAWGGRGGSSPAQVIiLSPGPVF 
KAGOJWWHLSRDQAGVQRCDLGSSQPPPLGFKRFSCLSLPSSWD 
YP^TVLCV^KMBADI*SGPNIDAPRWDQRTFI^RVKHFIiNITDPR 
TVFVSEREtiDKAKVMVEKSRMGWP PGTQVBQI.f .YAKKLYDSAF 
HPDTGBKMNVIGRMSFQLPGGMIITGFMI^FYRTMPAVIFWQW 
NQSFlOUiVNYTNPJJAASPTSVRQMALSYFTATTTAVATAVG 
LTKKAPPLVGRWVPFAAVAAAKCVNI PMMRQQBLIKSICVKDRN 
ENEIGHSRRAAAIG I TQVVTSRITNISAPGMII^PVIHERIjE KLH 
F^QKVKVL/SAPLQVMLSGCFL I FMVP VACGLFPQ KCBkFVS Yk 
BPKLQDTI KAKYGBI*EPYVYFNKGL 


5423 


3186 


905 


GVSMALGBBKABAEASBDTKAQS YGRGSCRERELDI PGPMSGEQ ~~ 
P PRLKAEGGIiIS PVWGAEG I PAPTCWIGTDPGGPSRAKQPQASD 
ANRBPVASRSBPAL3GLPPATMGSGDTJ IiSGESQVKKTKLSSSB 
EPPQTIiSIiPKroiCSGHDADTEDDPSI^ 
LSCLSQWKSVI^PGSAAQPSSCSISASSlX^SIiQGHQBRABPRG 
GSIAKVSSSLEPVVPOE^SSVVGI^PRPQWSPQPVFSGGnASGIi 
GRRRLS FQABYWACVLPnSLPPSPrJRHS PLWNPNKKYEDIjLDYT 
YPI^PGPQLPKKLDSRVPADPVLQDSGVDLDSFSVS PASTLKSP 
in v £> P£l UFPAKA I AJUPrSfc> PRK tr i>LxK.{jrt Pt» KVPUKQGwftniASW 
SQLASTPRAPGSRElARMERREPALRGAKDRIiTIGKHLDMGKPQL 
RTRDRGWPSPRPEREKRTSQSARRPTCTKSRWKSEEBVBSDDBY 
LALPARLTQVSSLVS YLGS I STLVTLPTGDlKGQSPIiEVSDSDG 
PASFPSS SSQSQIjPPGAAT.QGSGDPEGQNPCFLRS fvrahdsag 
EGSEJGSSQALGVSSGLLKTRPSIiPARLDRWPFSDPDVEGQLPRK 
GGEQGKESLVCX:\VKTFC\CQLEEr*ICr7LYiaV\ADVTDHGTPAR 
SNLTSIxK\SSLQLYRQFKJtDIDEHQSLTESVl^KGEILWX^I'E 
NTPVLEDVI jGR IAKQSGELRSHADRLYDS I IiASLI)MLAGCTL I P 
DKKPMAAMEHPCEGV 


5424 


3186 


905 


GVSMALGEEKAEAEASEDTKAQSYGRGSCREKELDIPGPMSGEQ 
PPRIjEAEGGIiISPVWGAEGIPAPTCWIGTDPGGPSRAHQPQASD 
ANREPVAERSEPALSGLPPATMGSGDLLLSGES QVEKTKLSSSE 
EFPQTI>SLPRTTICSGHDADTEDDPSIJU^LPQALDI>SQQPHSSG 
LSCI^SQWKS VLSPGSAAQPSSCS ISASS TGSSIiQGHQBRAEPRG 
GSIAKV^SLBFV\n5QEPSSWG:^ 

GRRRLS FQAKYWACVLPDSLPPS PDRHS PLWNPNKBYEDLLDYT 
YPLRPGPQLPKHiDSRVPADPVLQDSGVDIiDSFSVSPASTCKSP 
TIJVSPNCPPABATALPFSGPRBPSLKQWPSRVPC^QGGMGLASW 
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NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A- Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G^Glycine, 
H=Histidine, I*=Isoleucine, K=Lysine, 
L*»Leucine, M»Methionine, tfeAsparagine, 
P=Proline, Q=Glutaraine # R«Arginine # 
S=»Serine, ^Threonine, V=Valine, 
W=Tryptophan, Y»Tymsine, X=Vnknown , *=Stop 
Codon, /=pos3ible nucleotide deletion, 
\=possible nucleotide insertion) 








SQLASTP RAPGS RI1AR WERRBPALRGftKDRLTIGKHliDMGSPQL 
RTRDRGW PS PR P EREKRTS QSAR RPTCTESRWKSEP. EVES DDE Y 
LALPARLTQVSSLVSYLGSISTLVTLPTGDIKGQSPLEVSDSDG 
PASFPSS3SQSQLPPGAALQGSGDPKGQNPCFLRSPVRAHDSAG 
BGSLGSSQALGVSSGLI*KTRPSLPARl,DRWPFSDPDVEGQIiPRK 
GGEQGKHSLVQCVVKTFCXODLEELICMLYI^VADfVTnHGTPAR 
SNI#TSLK\S SLQLYilQPKKDlDEHQSLTBS VLQKQBILI^CXiLE 
NTPVLEDVI^R IAKQSGEIiESHADRLYDS ILASLDMLAGCTLIP 
DKKPMAAMEHPCEGV 


5425 


1086 


115 


GFCPSPSIXJHQPPRVI^P*l^MAVETFGFFMATVGLIJ4IiGVTLP 
NS YWRVSTVHGNVITTin-I FEN1HFSCATDSLGVYNCWKFPSML 
AI^GYIC^CRAI^ITAIIiLGFLGIilMIAGLRCTtWGGLEI^R^ 

aklaatagaph\ ilpgicgmvai \swyafnitr\dfsdplypgt 
kye lgpal ylgwsasli s ilgglclcsacccgsdedp aas arrp 
yqap vsvmpvats dqegds sfgkygrnalrvaalcrgprclpta 
pkkrgpgrgpfpysnlrgrprpvpvapprprprvlhshgpsqak 
ncs wevayl pseagsiii f 


5426 


42 


3435 


ATSSQSLGRADPPRGGTMKRSPGEGPSPSPMDQP5APSDPTDQP 
PAAHAKPD PGSGGQPAG PGAAGEAIiAVLTS FGRRLLV h 3 PVYIA 
GAVGLSVGFVLFGLAIiYLGWRRVRDEKKRSLRAARQLLDDEEQL 
lAOTLYMSHKELPAWVSFPDVEKAF^I^ 

LliAETVAPAVRGSNPHLQTFTFTRVEIiGEKPIJlI IGVKVHPGQR 

KBQI LLDLNI S YVGDVQ I DVEVKKYFCKAGVKGMQLllGVljRVXL 

RPLIGDLPFVGAVSMFFIRRPTl^IKWTGmWUIiDIP^ 

TM IMD3 1 A^FLVLPNRIJjVPLVPDLQDVAQLRS PIjPRGI I R IHL 

LAARGLS SKDKYVKGtt I EG KSDPYALVRLGTQTFCSRV IDE ELN 

PQWGETY^VMVBEVPGQElEVKVFDKDPDKDDFIiGRMKIiDVGKV 

LQAEVIiDDWFPI»0X3GG<^VHLRLEl*I*SLI^ 

VSS RPDPPSAAl LWYLDRAQDLPMVTS EL YP PQLKKGNKEPNP 

MVQLS IQDVTQES KAVYSTNCPVWKEAFRFFLQDPQSQELDVQV 

KDDSRAIiTLGALTLPLARLLTAPBLIIxDQWFQLSSSGPNERLYM 

KLVMRILYLDSSB I CypTVPGCpaAWDVDSENPQRGSSVDAPPR 

POTITPDSQFGTEHVIjRIHVIiF^DLIAKDRITiGGLVKG 

VKLK1»AGRSFRSHVVRKDIjIIPRWNEVFEVIVTSVPG 

DKDLDKDDFI£RCKVRLTTVLNSGFIiDEN^^ 

BRIiTPRPTAABI^EW^VNSblQTQKSAKIJUVAI^IYMBPJ^ 

LPLRKGTKHLS PYATIirVGDSSHKTK7riSOTSAPVWDBSAEPI.I 

RKP1 ITES I .PXQVRGEGTGVI £&t£I»PLS ELLVADQI>CLDRWFTL 

S SGQCQVLLRAQJJ3 1 LVS QKSGVEAH5 HS YSHS SS^ LS HEPEL3 

GGP PH I T3 SAPEV \ RQRI/THVDS PLEAPAGPLGQVKLTliW Y YSB 

ERKLVS I VHGCRSIJUJI^RDPPDP YVSLUJ^PDKNRGTKRRTSQ 

KKRTLS PEFNERFEWE LPLDBAQRRKLDVSVKSNSS FMSR EREI . 

LGKVQliDLiAETDLS QGVARW Y DIjMDNKDKGS S 


5427 


42 


3435 


ATSSQSLGRADPPRGGTMBRSPGEGPSPSPMDQPSAPSDPTDQP 

PAAHAKPDPGSGGQPAGPGAAGEALAVLTSFGRRLLVLIPVYIA 

GAVGLSVGFVLFGLALYI^WRRVRDBKBRSI^RAARCIXDI^EQL 

TAKTLYMSHREDPAWSFPDVEKAEWIjnCI^^ 

LLAETVAPAVRGSOTHLQTFTFTRVEI^EKPLRIIGVK^PGGJR 

KEQILLDLNI SYVGDVQIDVEVKKYFCKAGVKGM QLHGVLR VI L 

Epi»IGDLPFVGAVSMFFT^PTLDIWIt»frtaM 

TMir©SIAAFLVl^PNRiJiVPLVPDI^ 

PQWGBTYEVMVHBVP^EIEVEVPDKDPDKDDFIiGRMK]^ 
LQASVLDDHFPr,<XK^QVTILRr.™LSLL3DAEKLFJQVLQWNVJG 
VSSRPDPPSAAILNnnTLI^OTLPfm'SELYPPOLKKGNKEPNP 
MVQLS IQDVTQES KAVYSTN CP VWEEAFRFFIiQDPQSQEIjDVQV 
KDDSRALTIjGALTLPIiARLLTAPELILDQWFQLSSSGPNSRLYM 
KiVMR ILYLDSSHICFPTVPGCPGAWDVDSENPQRGSSVDAP PR 
PCHTTPDSQFGTRHVLR IHVIJBAQDLIAKDRFLGGLVKGKSDPY 
VKI.KIiAGRSFRSHVVl^LNPRWNEVF 
DKDUDKDDFLGRCKVRLTTV^ 
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corresponding 

to first 
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y^pi-I fin** of 
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Predicted end 
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location 
corresponding 
to first 
amino acid 
residue of 

CUUXXIVJ C-% JLV* 

sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C= Cysteine, D=Aspartic Acid, B- 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine, K-Lysine, 
L=Leucine, M^Methicnine, N«Asparagine, 
P=Proline, Q=Glut amine, R«Arginine, 
S«=Serine, T»Threonine, V=Valiue, 
W "^Tryptophan, YwTyrosine, X=0nJcnown., *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








BRLTPRPTAAEtEHVI^VNSLIQTQKBABIAAAt^ 

LPLRKGTKHI>SPYATI*TVGDSSRKTKTISQTSAPVWDESASFr»I 

RKPKTBSLELQVRGEGTGVLGSLSfcPtiSBI^^ 

SSGQGQVUiRAQLGII>VSQHSGVEAHSHSYSHSSSSLSBEPELS 

GGPPHITSSAPEV\RQRI>THVDSPL»KAPAGPLGQVKI»TLWYYSE 

BRKLVSIVHGCn^LRQNGRDPPDPYVS^LLLPDKNRGTKRRTSQ 

KKRTLSPBPNE^PEWBLPI^fiAOPJiKLDVSVKSNSS FMSREREL 

DGKVQLDLAETDLSQGVARWYDLMDNKDKG^ 


5428 


3 


1839 


SSP^ERI^CAIJ^PWIiVS5RPARPAQI^RPGKM\^DGA5Bt«BD 
IiVHFSVSBLPSRGYGVMEBIRRQGKUTf\rrLKXGmKPSMiR^ 
LAAS I PYPHAMFTNDMMECKQDE IVMQGMDPSALEALINFAYNG 
NLAIDQQmWiI>lGASFLQW3S iKDACCTPItRBRIiHPKNCLQV 
RQ FAKT^CAVI»YDA ANSFIHQH FVEVSMSEK FIjAL PLEDVLEIi 
VS RDKLNVKS EEQVFEAALAWVRYDRKQRGTPL \RNLQSN I RIjL 

pcrpqfijsdrvqqddlvrcchkcpj>lvdeakdyllmperrphi,p 
afrtrprccts iagliyavcglnsagdslnvvbvpdp iancwer 
crpwtcarsrvgvavvngiikyaiggyt^ 

WTRVGSMNSKRSAMGTVVIjDGQI YVCGGYDGNSSIjSSVBTYSPE 

TDKWTWTSMSSNRSAA\GVTVFEGRiyVSGGHDGLQIFSSVEH 

YNHHTA3WHPAAC3tROCRCRHGftAi>I^ I 

A^YSSV\A1^WCLIVPM\HTRR\SRVSLGGPAVGRI>YAVWGVT 

!X^SI^\SSVGDVLTPETlKrWTFM\APMACHEGGV 

I ! 


5429 


828 


202 


RRBDAI^SSEGCbW PSESTVSGNGI PEPQVYAPPRPTDRUVVP P F 
AQPvERKmFQP^PYDGHEIDLPPTISr^WEKPPPY^PCTIiQ 
LRDPEQQLELNRESVRAPPNRTI FDSDIiMDSARLGGPCPPSSNS 
GISATCYGSGGRMEGPPP\TYSEVIGHYPGSSFQHQQSSGPPSL 
UEGTRtiHHTH IAPLESAAI WS KBKDXQKGHPL 


S430 


441 


1507 


QKRRXRRRKKIMKTIQPKHHNSISWAIPTGLiAALCliFQGVPVRS 

GDATFPKAMI3^fVTVRQGBS ATLRCT I DNRVTR VAWLNRSTI IiYA 

GNDKWCLDPRVVLLSlTrQTQYSIBIQNVDVYDEGPYTCSVQTDN 

HPKTSRVHLrVQVSPKIVEISSDISIMEGNNISLTCIATGRPKP 

TVTWRHISPKAVGPVSBDBYbElQGITREQSGDYKCSASNDVXA 

APV\VRRVKVTVNYPPYISEAKGTGVPVGQKGT^ 

EFQWYKDDKRLI /EGKKGVKVENRPPLSKLIFFNVSRHDYGNYT 

CVASHKW3mmSIMLFGPGAVSEVSKGTSPJUVGCVWIX 

HLLLKF 


5431 


" i" 


1312 


AAAAPGSRRRRPLPDRPHMAHGYEAPPPPAPRSPAWRARSKPV\ 
I»PGITINP \TIAEG PS P\TSEG ASEANLVDLQKKIiEELEIjDEQQ 
KKRLEAFLTXJKAKVGEliKDDDFERISBDGAGNGGVVTKVOTRPS 
GLlMARKLIHIiEIKPAXP^QIIREIiQVIJIKCNSPYIVGFYGAFY 
SDGBIS ICMKHWX^SIJX^KRAKRIPEKIIiGKVSIAVLRGLA 
YLRElfflQIMHRDVHPSNILVNSRGEIKI>CDFGVSGQLIDSMANS 
FVGTOSYMAPF^I^THYSVQSDIWSKGLSLVBLAVGRYPIPPP 
DAKBLEAI FGRPVVDGEBGRPHS ISPRPRPPGRPVSGHGMDSRP 
AMAIFEI^YIVNBPPPKriPNGVFTPDFQBFVNKCLIKNPAERA 
DLKMLTNHTPlKRSEVEBVDFAGyWjCKTLRLNQPGTPTRTAV 


5432 


2 


1312 


AAAAPGSRRRRPLPDRPHMAHGYEAPPPPAPRSPAWRARSKPV\ 
LPGITINP\TIAEGPSP\TSBGASEANLVDLQKKLEELELDBQQ 
KKRLEAFLTQKAKVGELKDDDFERI SEIXIA^ S 
GLIMARKLIHLEIKPArRNQIIREIiQVLHECHSPYIVGFYGAFY 
SDGBISICM^MDGGSLDQVLKEAKRIPEEILGKVSIAVLRGIJV 
YliRBKHQlMHRDVKPSIilLVNSRGEIKLCDFGVSGQLIDSMANS 
FVGTRSYMAPERIX3GTHYSVQSD I WSMGLSLVELAVGRYPI PPP 
DAKELEAI FGRPWDGEEGEPHS ISPRPRPPGRPVSGHGMDSRP 
AMAI FELIjaYXVNEPPPKLPNGVFTPDFQEFVNKCXIKKPABRA 
DLKMLTWTPII^BVEEVDFAGWIXrKTLRLNQPGTPTRTAV 


5433 


360 


1B85 


SVQBDKVGFEDPUILCSWRARACPCTWPHC/ CTGUiECXGFAGV 
LFGWPSLVFVFKNKDYFKDLCXiPDAGPIGmTGQADCKAQDERF 
SLI FTLGS FMNNFMTFPTGYI FDR FKTTVARL I A I FFYTTATLI 
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Amino acid segment containing signal peptide 
<A*cAlanine, O Cysteine, D=»Aspartic Acid, B= 
Glutamic Acid, F*-Phenylalauine, G=<31ycine, 
H^Histidine, I=Isoleucine, K>Lysine, 
L=Leucine, M=Methionine, N=>Asparagine, 
P-Prolitie, Q=Glutamine , R^Axginine, 
S~Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y-Tyrosine, X=Unknown, *=Sbop 
LOuOD, / — possiDJ-Pi nucieoLiae aej.et.xori, 
\=pocsible nucleotide insertion) 








IAPTSAG9AVIiFIiAMPMLTIGGIIiPIiITNIjQIG^FGQHRSTI 
TTLYNGAPDSSSAVFLIIKiaiYEKGISI^/VIJjHI^LCLQYLAC 
STHFTPDAPGAHPXPTAPQLQLWPVPWEVraHKGRBG/QQLSMKrr 
GSYSQIWSFQRRKRPQGQGRSRIISAPSGATL/CSEIRFAWHLVWL 
SVIQLWHYLPlGTLNSLLimAGGDf^VSTyTNAFAFTQFGVL 
CAPWNGIJJTORLKQKYQKEARKTGSS TTiAVALCSTVPSLALTSL 
LCLGFALCASVPILPlXiYIjTF II^VISRSFLYGSNAAPLTIiAfT 
SEKPGKIiFOLVMALSAWSIJ^PPl FTI1»IKGSIjQNDPFYVNVMP 
MLAIIiTFPHPPLVYRBCRTWKESPSAIA 


54 34 


66 




RYAAI»IISI,IQHI^»Rl^C3Rttn:MSP^ 
HGPFIiGC^QYPAO)YVRPi^SADGHlVKVLEGQVCPACGANX>V 
IJlQGRFGHFIGCmOTECKin'BLIDKPDBTAITC^CK^eTCHLVQ 
RRSRYGKTFHS CDRYPECQFA INFKP I AGE CPE CHYPLIj I EKKT 
AQGVKHFCASKQCX3KPVSAB 


543S 


4704 


1597 


PGDSSQRLAEMSNAKERKHAKKMRWQPIWVTLSSGPVADRGVKH 
HSGGEKPFQAQKQBPHPGTSRQRQTRVNPHSLPDPBVNKQSSSK 
GMFRKKGGWKAGPKG^SQEIPKYITASTFAQARAAEISAMLKAV 
TQKSSNETLVFX^LPRHMRRRAMSHNVKR 

VHQKKSHSKNKCHKARR01M^Tr*EFNRRQKKNIWLETHIWIIAK 
RFHMVRXWGYCl/^PTVKSHRACY^^ 

LKGKBEB IIJUy^SGT^IDTQLTFAAVHCl^KRQGSLVLYRVN 
KYPREMI^PVTFIWKSQRTPODPSESRQLWIWliHPTIiKQDII^ 
IKAACQCVEPI KSAVCIADP^PTPSQEKSQTEIiPDEKIGKKRKR 
KDDGEJiDVKPIKKIXGDGTRDPCLP YSW I SPTTOI I ISDI*TMEMN 
RFPJ^IGPLSHSILTEAIKAASVHTTOEOTEfiTPHRWWIETCKKP 
D5TVSIJICRQEAIFEX*LGGITSPAKIPAGTILGLTVGDPRIMLPQ 
KKSKATiPNPEKCX^BKVRQIXLBGVPVECTHSFIWNQDICKSV 
TENKI8DQDLNRMRSELLVPGSQLILGPHESK1 PILLIQQPGKV 
TGEDRIjGWGSGWDVIiIjPKGHGMAFW I P FIYRGVRVGGLjKBSAVH 
SQYKRS PWPGDFPIK:PAGMliFAEKQAKNlJCiEKyKRRPPAKRPN 
YVKLGTLAPFCCPWEQLTODWSSRVQAYEEPSVASSPNGKESDL 
RRS EVPCAPM P KKTHQPSDEVGTS I EHPREAEEVMDAGCQESAG 
PERITIX)EASKNHVAATGSHrX^^Ria,l^l4SAWC^PfiSKPS 
RGGRRAPGRGCX)GLTRBACLS ILGKFPRALVWVSLSLIjSKGSPF. 
PHTMI CVPAKEDFLQLHBDWHYC5GPQES KHSDP FRS Kl LKQKBK 
KKREKRQKP\GRASG»3PAGEBPVAGQEAr,TLGLW3GPI^RVTl4 
HCSRTLLGFVTC^urSMAVG OGEALGr b itilj XJJP1X« iisJFAfty 
R6r»VLLRPPASLQTRFARIAIBV 


5436 


1781 


635 


ASDSlFWSBARTTRKIAOitGCQWStPKRMPtiVVFCGLPYSGKSR 
RABELRVAT^AI^RAVYVVDOAAVLGAEDPAVYGDSAREKAnRC 
AI^SVFJ»LSRHDVVIIJ>SUrciKGFRYB^ 
LVYCVRI>GGPIAGPaVAGANENPGRlfVSVSVmPRAEEDGRAQAA 
GSSVLRE LHTADS VVNGSAG^U^VPKBliEREBSGAABS PALVTPD 
SBKSAKHGSGAFYS PEIiBALTLRFEAPDSRNRWDRPLFTLVGL 

QVlAGXMEAQKSAVPGX^IaLTLPGTTBHLRFTRPLTM 
QPISYTKMHPNNRNIiPQLANMFLQYLSQSLH 


5437 


733 


1672 


<#FJ^EFGGPLH?PAMFIJffiUM^ 

PRRVDSSS ENSGSDWDSAPETMBDVGHPKTKDSGALRVSRAASE 

PSKEEPXJVEQIyGSKRMDSLKWDQPXSSTQESGRLE 

WDHVDSGGTRRPGVSPEGGL\GVPGPGAPI^KPGRREKLIiGWrjR 

GEPGAPSRYLGGPEECXOISTNLTLHIiIiELLASAIJAI^ 

AAtDTLGLRGPLGLWLHGLLS FLAALRGLHAVLSLLTAHPLHFA 

CLPGLLQALVLAVST.REPNGDEAATDWR^EGLRRKGEEQRGDPG 

KGL 


5438 


2443 


1152 


TKPRKRRHQPASQRQRPWSSDSTGDlXARGKGRKiiKNKOSDRVS 

IJ^PSI^P^CQSEARCX5PHLRAAKWr»HFPQUAI^^ 

MSRPALiaRSWPLTVLYiliLPFGAliiPLSRVGTOPVSRVALYKS 

VPTPJjLSRAWGRLNQVKLPHWT.RRPVYSLYIWTFG^MKBAAVE 

UI.HKYRNLSEFFRRKLKPQARPVOGUISVlSPSDGRILNFGaVK 

nceveqvkgvtyslks flgprmcthdlpfp paascds fknqlvt 
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Amino acid segment: containing signal peptide 
(A*=Alanine, C*=Cysteine, INAspar tic Acid, E- 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I^Isoleucine, K>Lysine, 
L=»Leucine, M=Methionine f N=Asparagine , 
P= Proline, Q=Glutamine, R«Arginine, 
S=Serine , T=alireonine, V-Vali»e, 
^Tryptophan, Y=Tyrosine, X ^.Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








fiKGNBLYHCVXYlAPGDYHCraSPTDWTVQHRRHPPaSIiMSVNP 
GMARWIKBLFCHNERVVI.TGDWKHGPFSIiTAVOAT\NWGSIRiy 
FDRDLHIVSPRHSKGSYNDPSFVTHTNREGVPMA^ 
FNLGSTIVLIFKAPKDFNFQLKTGQKrRFGRALGSli 


5439 


2443 


1152 


TKPRKRRHQPASQRQRPWSSDSTGDLLARGKGRJCBKNKGSDRVS 
LAPPS LRR PMMCQSBARQG;PFJJRAAKWIJn^^ 
MSRPALKLRS WPLTVLYYLIjPFGAIiRPLS RVGWRP VSRVALYKS 
VPTRLI^RAWGRLNQVKLPHWIiRRPVYS l yi wtfgvnmkeaave 
DLHHYRNLSE F FRRKLKPQARPVCGLHS VI S PSDGRILNFGQVK 
KCEVEQVKGVTYSLESFLGPRMCTBDLPFPPAAS CDSFKNQLVT 
RKGNELYICTIYIJ^DyHCF>KPTDWTVSHRRHPPGSIMSV^ 
GMARW I KELFCHN3RWLTGDWKHGF PS LTAVGAT\NWGS IRtY 
FDRDLHTNSPRHS KGS YNDFS FVTHTNREGVPWALRGEHLG /QS 
FbnUGSTIVLIFKAPKDPNFQLKTGQKIRFXJKALGSL 


5440 


693 


253 


BPI PVTPDHRI,VTMTHIV\QTFSPVNS\GQPPNYEMI i KEEQEVA 

MW3APHNPAPPMSTVIHIRSBTSVPDHVVW 

FIAFAYS VKS RDRKKVGDVTGAQAYASTAKCLN I W AI^I LG I FMT 

ILLIIIPVLWQAQR 


5441 


2 


2D54 


crdggxngpmvspmkple i ktqcsgprmdpkicpadpafps fin 
nsdlwvanj:etgeerrltfchqglsnvlddpksagvatfviqee 

FDRFTGYWWCPTASWEGSBGLKTIiRILYEEVDESBVEVIHVPSP 

ALEEPJCTDSYRYPRTGSKNPKIALKliAEFGTOSQGKlVSTOBKE 

I,VQPPSSLFPKVEYIARAGWTRDGKYAHAMFIiDRPCX)HIiQLVLL 

FrALir lfi> rJ£WBEQ\RI-«SARAVPKnVQP YVAnfEKVTNVWINVH 

DI FYP FPQSEGEDELCFLRANECKTG FCMLYKVTA VLKSQGYDW 

SBPFSPGF^EQSIiCNArWWEETKLVYFQGTKDTPLEHHLYVVS 

YEAAGEIVRLTTPGFSHSCSMSQNroMFVSHYSSVSTPPCVHVY 

KLSGPDDDPLHKQPRFWASMMEAAKIFHFHTRSDVRLYGMIYKP 

RAI^PGKlCHPTVIiFVYGG PQVQIjVNNS FKG I KYLRLNTIASLG Y 

AVWlIX^GSCQRGLRFBG^IOrQMGQVEI EDQVEGLQFVAEKY 

GFIDL3RVAIHGWSYGGFLSLMGLIHKPQVFKVAXAGAPVTVWM 

AYDTGYTBRYMDVPENNQHGYEAGSVALHV^ 

GFLDENVHFFmWLVSQLIRAGKPYQLQVALPPVSPQIYPNER 

HS IRCPESGEHYBVTIXHFLQEYL 


5442 


1 


3474 


CGQRSRRRS PDMPEAKPAAKKAP KGKDAPKGAPKEAPPKBAPAB 
APKEAPPEBQSPTAEEPTGVFLKKPDSV3VBTGKDAVVVAKVNG 
KRLPDKPT IKHFKGKWLBliSSKSGARPSFKESHNSASHVYTVBL 
HIGKVVLGDRGYYRIiEVKAKDTCDS CGFNIDVEAPRQDASGQSL 








I PPE IWEIJjKGAKKS E YBKI AFQYG I TDLRGMLKRLKKAKVEVK 
KS AAb^KXI*DPA YQVDRGl^I KLMVEISDPDLTIjKWFKZSGQEIK 
PSS KYVFENVGKKR I LT INKCTLADDAAYEVAVKDEKCFTELFV 
KEPPVLIVTPI^EDQQVFVGDRVEMAVEV8EEGAQVMWMKIX5VBL 
TREDSFKARYRFKKDGKRJIILI FS DVVQEDR3RYQ VITNGGQCE 
AEL IXTCEKjQLEVLQDI ADLTVKASEQAVFKCEVS DEKVTGKWYK 
MGVEVRPSKRITXSHVGRFHKLVIDDVRPEDEGDYTPVPDGYAL 
GSLSAKLNFLEIKVE YVPKQ\KPP K I PLGFASGGKTSENAD / IV 
VVAGNKIiRIJ)V\SITGEAPSPFAT:\WLKG\DEVFTTTEGRTRIE 
KRVDCSS FVIESAQREDEGRYTI KVTNPIGBDVAS IPLQWDVP 
DPPEAVRITSVGBDWMLVWEPPMYDGGKpVTGYLVBJUOiKGSQ 
RWMKLNFEVFTSTTYKSTKM IEG I LYEMRVFAVNAIGVSQP SMN 
TKPFMPIAP1^3PIHLI VEJDVTIJTTrTLKWRpPlJR IGAGGIDG Y 
LVBYCI^SEEWVPANTEPVBRCXJ FTVKNLPTGARI LFRWGVN 
IAGRSEPATT^QPVTIREIAEPPKIRLPRHLRQTYIRKVGEQLN 
LVVPFG^K^RPQWWTKGGAPLDTSRVR^TSDFDT\7F^ 
RSDSGEYET^VQIENMKDTATIRIRVVERAGPPINVMVKB 
NALVEWQAPKDIX3NSB IMGY F\^KADIG<TMBW FNVYERNRHTS C 
TVSDLI VGHEYYFRVYTBNI CGLSDSPGV3KNTARILKTGITFK 
PFE YKEHDFRMAP KFLT PL I DRVWAG YSAALNCAVRGHPKP KV 
VWMKNKMBIRFJJPRPLITimjG^ 
N BliGEAIiAB CKLEVRVPQ 
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Amino acid segment containing signal peptide 
(A= Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F-Phenyl alanine , G=Glycine, 
H=Histidine, I»lsoleucine, K=Lysine, 
L=Leucine, M=Nethionine, N=Asparagine , 
P=Proline, Q=Glut amine, R-Arginine , 
S^Serine, T=«Threonine, V= Valine, 
H=Tryp t opha n , Y-Tyroaine, X=0nknown, *~Stop 
Codon, /^possible nucleotide deletion, 
\epossible nucleotide insertion) 


5443 


66 


1003 


SRGQJjDAGQSS eqhggnrqpeqs rs rss s s sss prrsrsaaepa 
MALSMPLNGLKEEDKEPLIELFVKAGSDGESIGNCPFSQRLFMI 
LWLKGWFSVTTVDLKRKPADLQNI»BJ^THPPFITPNS bvktdv 
NKJ!EEPI^BVLCPPKYLKLSPKHPESm7VGhn>IFAKFSAYlKNS 
RPBANEM^QtiLKTI^KriDEyr^SPLPDEIDENSMEDIKFSTR 
KFLDGNBMTLADCNLIiPKLHlVl£\^^ 

LTOAYSRDEFTm^PSDKEVEI \AYSDVAKRI^QVKSRLLKEVS 
FMSSP 


5444 


2 


344 


SGPIGVTGAQMAKWLIU>YI^FGGRRPPPQPPTPUYTKSDII^AY 
RRQKNIJ3FBDPY*D8BSRIJgPDPAGPGD£KWPGDAKYGepKHRIi 
I KVEAADMARAKALIjGGPGBELBADTEYLDP FDAQPHPAPPDDG 
YMB P YDAQWVMSBLPGRGVQLYDTP YEEQDPBTADGPPSGQKPR 
QSRMPQ5DERPADHYDQPWBWKKDHI SRAFAVQFDSPjBWBRTPG 
SAKELRRPPPRS PQPAERVDPALPLEKQPWFHGPLNR ADAESLL 
SLCKEGSYLVRI^K-mPQDt^I^LRSSQGFLHLKFARTRKNQVV 
LGQHSGPFPSVPE^VIOTSSRPLPVQGABHIJu^LYPVVTQTP* 
* PD WGDRiU^GQVATGLPELWGAEAPSAAAHPGLHRERHPEt3liP 
RAEKPGLRGPLI^LREPLGAGPRGPHGLQEPRRCQVWFSQAPAH 
QGGGCGYGQSQGPSGRPRGGAGSRH 


5445 


2364 


486 


ILSRGFrX3SVBIClQI»PLPASEPVLIiTWARRRWRETRSRREPT 
TLRAQSVCPWWI*F^PJ4NRSIPVEVDESEPYPSQLLKPIPEYSP 
EBESE PPAPNIRNmPNSLSAPTrCLHNSSGDFSG^STLKCANH 
QRPV3RQVTCLRT(^LEDSEr^FCRRilPGLGICAFPSGCSAVSBP 
ASESVVT3u^AEHQFSPMEKRNQHLVS^^ 

QSLPNASADSLGGSQEMVQRPQPHRMRAGI»DLPTIDTGYDSQPQ 

DVLGIRQLBRPIiPLTSVCYPQDI>PRPLRSRBFPQFEPQRYPACA 

OlfljP PNLSPHAP WMYHYHCPGSPDHQVPYGHD YPRAAYQQVTQP 

ALPGQPIJGASVRGLHPVOKVlliNYPSPWDOEERPAQRDCSFPG 

LPRHQDQPHI IQ P PNR AGAPGESLEC PABliRPQVPQPP SPAA V PR 

PPSNPPARGTI»KTSNLPEELRKVFITYSMDTAMEVVKFVWFI«LV 

NGFQTAID I FEDR TRG IDI I KNMER YLRDKTVMI I VA1SPKYKQ 

DVFX3AESQLDED3HGLHTKYIHRMMQIEFIKX2GSMNFRFIPVLF 

PNAKKEHVPTWI^NTHVYSWPKNKKNIIJ^^ 

LPTLQWPL 


5446 


972 


161 


SSWSWCTGRMRKIIIIjWGIjuLWMIjFVSELRAATKIjTEEKYELKEC^ 

tldvkcdytlekfassqkawqiirdgempktijvcterpsknshp 
vqvgri iledyhdljgllrvrmvniiqvedsglyqcvryqppkeph 
MIJTORIRIWT KGPSGTPGSl^STfflWYKI pptttkalcplyt 
TPRTVTO^PPKSTADVSTPDSEI^TNVTDIIRVPVFNIVILLA 
GGFLSKSLVFSVI»FAVTIiRSFVP*AHEPTRMSSDFQPHPSGSCA 
KGGGRR 


5447 


207 


617 


MTAHTIiS LMASIjVA^DSOSEAETEHAGSFNATGO^KDTSGVAR 

PPGQDFASGTLDVPKAGAQPTKHGS CBDPGG YRLPIAQLGRSDR 

GSCPSQRLQWPGKSPQVWPIKEPSCSSLWTSHVPASHMPLAAA 

RFKQVKLSROTPKSSFHAQSESKTVGKNGSSFQKKKCBDC^ 

TPRFJ J RQRQiU,STF^KGKb^PO^PPAGPJ\PAPLYVGPGV 

IQPYI^SHYKETWPRXVLFHLRGlffiGPWriQWC^VLSKSHML 

LSTSMDKTFKWNAVDSGHCliO/IYSLHTEA^ 

S^FDFAXiHLTDLETGTQLFSGRSDFRITTLKFHPKDHNIFLCG 

GPSSKrtKAWDIRTGKVMRSYKATIQCrrXiDILFI*RF^SEFI»SSTO 

ASTRDSADRTI I AWDFRTSAKI SfcJQ I FHERFTC PS ^ALHP R K PV 

FLAQ rW^N I JjAXjrSTVWPYRMSRRRRYEGHKvfiGYS^ 

GDIAVTGSAIXJRVI^SFRTASRACTI^HTQAC^GTTYHPyaP 

SVLATCSWGGDMKIW1£*AFHWLSIjGEAIGDLAPARGYSGPGRSL 

KSPSPSKSIiIiVIiLCGRAMFQPATCPWQLPALSK 


5448 


194 


1833 


MASKVTDAIVWYQKK!Gi*YDO£ 

HVKPDL XDVDLVRGSAFAKAKPESPT^S LTTKG IVRWFFP F FF 
RWWI^VTSKVIFFWLLVLYLLQVAAIVIjFCSTSS P1IS1 PLTEVI 
GPIWUOiU/nTOOQIVSTRTPKPPl^T^^ 
mBGDGSSTTDNTQEGAVQNHGTSTSHSVGTVFPJ>LWHAAFTI»S 


GSKKAKNS IDKS TETDNG YVSIjDGKKTVKSGEDG IQNHBPQCBT 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, B» 
Glutamic Acid, ^Phenylalanine , G*=Glycine, 
H=His t idine , I=»Isoleucine , K=»Lysine , 
L^Leucine, M=MetIiionine, N=Asparagine , 
P=Proline f Q«Glutamine, R=Arginine f 
S^Serine, T^= Threonine , V=Valine, 
W^Tryptophan, Y^Tyrosine, X~Unknovn, *=Stop 
Codon, /=po3sible nucleotide deletion, 
\=possible nucleotide insertion) 








IRPEETAWNTCTLRNGPS KDTQRTI TNVSDBVSSBjBGPBTG YSL 
RRHVDRTSEGVl,RNRKSHHYKKHYPrJE3APKSGTSCSSRC5SSR 
QDS ESARPES ETBDVLWEDLLHCAECHS SCTS ETDVENHQINPC 
VTCKEYRDDPPHQSHIjPWIiHSSHPGLKKISArVWEGNDCKKADMS 
VLBISGMIMNRVNSHI PGlGYQIFGNAVSLILGLTPPVFPOiSQA 
TDLBQLTAHSAS KLYVIAPGSNBDVI VIiSMVI I SFVVRVSLVWI 
PFFIiCVAERTYKQVGIM+TSEGVLHNRKSHHYX^ 
SGTSCSSRCSSSRQDSESARPKSETKDVLWEDLLHCABCHSSCrr 
5ETDVENHQINP CVKKE YRDDPFHQSIILrPWIJiS SHPGUSKISAI 
VWEXJNDCKKADMSVIjBISGMIMNRVNSHIPGIGYQIFGNAVSIJ: 

LGIjTP pvfrlsqatdleqltahsasel y VI afgsnetw I vi^smv 

I ISFWRVSLVWI FFFLLCVABRTYKQVGIM 


5449 


194 


1833 


MASKVTDAI VH YQKKIGAYDQQI WBKSVBQREI KGLRNKPK^TO~ 
HVKPDL I DVDLWG S APAJCAXPBS PWTSLTTKG IVRWPPPFPF 
RHWr^VTS KVI FFKLLVLYL^QVAAI VLPCSTSSPHSIPLTEVI 
GPll^IJaXSTVHCQ^STRTPKPPLSTGGK^ 
HREGIX3SSTTDNTQEGAVQ1WGTSTSHSVGTVFRD1jWHAAFF^ 
GSKKAKNSIDKSTETDtK3YVSIJX3KKTVECSGEDGIQNHEPQC5T 
I RPEETAWNTGTLRNGP.S KDTQRT I TNVSDBVS S EEGPETG YS h 

QDSBSARPBSETEDVLWEDI.LHCAKCHSSCTSBTDVKNHQINPC 
VKKEYRDDPFHQSHLP6JLR5SHPGLBKISAIVWEX5NDCKKADMS 
VLKISGMIMNRVSSHIPGIGYQIFGNAVSLILGLTPFVFRLSQA 
TDLEQLTAHfiAS ELYVIAFGSNEDVI VLSMVI I S FWRVSLVWI 
PPFIiCYAERTYlCQVGIM*TSEGVIJUIRKSHHYKKHYP^ 
SGT3CSS RCSSSRQOSRSARPESETKD VLWEDL AHCAECHSSCT 
SBTDVBNHQINPCVKKKYRDDPFHQSHLPWLHSSHPGIiEKISAI 
VWEGNDOCKADMSVLKISGMIMNRWSHIPGIGYQIPGNAVSLI 
LGI/TP FVFRLS QATDLKQLTAHSAS KLYVXAFGSNS DVTVLSMV 
I ISFWRVSriVWIFFPLLCVAERTYKQVGIM 


5450 


B136 


1242 


GQQFAS PFG*raPEOTVAMALTDIDI^LQFSMSQPEAl>IiI>LAAG 
PADHLI.I^LYGGIll^VRLVLGQREI^I^TPAFnil^MIPHTW 
LTVVKGWATIjSVDGFIiMASSAVPGAPI*BVPYGLF 
YLRGTSRPU^CIJIAATI^RSrjI^PLTPDVHEGGABEFSASDD 
V ALG F SG P H S LAAP PANG TQD KG TLE FTTjTTQS RQA P LAFQ AGG 
RRGDP I Y VD I FEGHLRAWBRGG^5TVLLHNSVPVAI>GQPHEVSV 
HINAHRLE I S VDQ YPTHTSNRGVLSHf LKPRG^IJ^LGGIJ^ABASR 
HIiQKHRLGLTPKATNASIJX3CMEDI£WGQRRGI^ 
AGCRLKEBEYEDDAYGHYEAFSTTiAPBAW PAMEL PE PCVPEPGL 
PPVFANFlXJIiLT IS PL WABGGTAWLB WTRHVQPTIiDIjMEAEIiR X 
SQVLPSVTRGAHYGEIjEIiDI LGAQARKMFTI^^ 
GSKOTSIX)LVI^SVTARVPMPSCI^RGOTYLLP IQVNPVNDP? 
H 1 1 FPHGSIjMVI LEHTQKPIiGPEVPGAYD PDSACEGLTFOVLGT 
SSGLP VERRDQPGEPATEFS CRELEAGSIiVYVHOGG PAQDLTFR 
VS DGLQ A3 P P ATL K V VAJC RPAIQIHRS TG LR I AQG3 AM P I L PAN 
LSVETNAVGOJDVSVLFRVTCA^ 

AFHQRDVE QGRVRYLSTDPQHHAYPTVETrtiALEVQVGQE ILSNI, 
SPPVTIQRATVWMldtfiEPIjrrQNTC^ 

PPTFHYEVVQAPRKGNLQIiOGTRI»SDGO^FTQDDIQAGRVTO 
TARAS EAVEDTFRFRVTAPPY FS PI.YT PP IHIGGDPDAPVLTNV 

llvvpeggegvlsadhlfvkslnsasyi,yevmp^prix;riahrg 

TQDKTTMVTS PTNEDLLRGRLVYQHDDSETTEDDI PFVATRQGK 

SSGDMAWEEVRGVFRVAIQPVNDHAPVQT I SRI FHVARGGRRLL 

TTDDVAFSDADSGPADAQLVLTRKDIiFGS I VAVDEPTRPIYRF 

TORDU«^VIiFVHSGAI>RGWIQI^VSI>GQH{^ 

YLRVANGS SLVVPO<3GOGTIDTAVTif ILI>TNLJ) 1 RSGDEVHYHVT 

AGPRWGQLVRAGQPATAFSQ QDLLDGAVLYSHNGS LS PEDTMAF 

SVEAGPVKTDATLQVTIAljEGPkAPI^k^ 

PJ^DQI^AAQEAVPPADIVFSVKSPPSAGYXVMVSRGAIADEPPS 

I^pVO^PSQEAViyrcRVLYIJISRPEAWSnAFSLDVASGLGAPLB 

GVLVEUSVLPAAI PIJ3AQNFSVPEGGSI»TLAPPLI»RVSGPYFPT 
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amino acid 
residue of 
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Amino acid segment containing signal peptide 
(A=Alanine, C=*Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, GaGlycine, 
H=Histidine, I-Iooleucine, K»Lysine, 
L=Leucine, M»Methionine , N^Asparagine, 
p-Proline, Q=Glut amine, R*-Arginine, 
S^Serine, T= Threonine, V=Valine. 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








UXJLSLQVLEPPQHGPLQKEDGPQAirri^AFSVmMVBEQLlRYV 

HDGSBTLTDS PVIJMANASEMDRQSHPVAFTVTVLPVNDQPP 1 LT 

TNTGLQMWEGATAPI PAEALRSTDGDSGSBDLVYTIEQPSNGRV 

VLRGAPCTEWSFTQAQIJX^LVLFSHRGTLIX^FPFR 

TS PGHFTRVTAQKQVLLSLKGSQTLTVCPGSVQPLSSQTLRASS 

SAGTDPQIJjLYRVWGPQLGRLFHAQQDSTGEALVNPTQAEVYA 

GU ILYEHBMPPEPFWEAHDTLELQLSS PPARDVAATLAVAVSPB 

AA<^RPSHLWKNKGLWVPEGQRAJRITVAAI^ 

RSEHDVLFQ^n^FPSRGQliIiVSEEPIiHAGQPHFIiQSQLAAGQLV 

YAHGGGGT0X)DGFHPRAHIiQGPAGA3VAGPQTSEAFAITVRI>VN 

BRPPQPOASVPLRIiTRGSRAPISRAOLSVVDPDSAPGBIBYKVQ 

RAPHNGFLSLVGGGIiGP VTRFTQADVDSGRLAFVAI^SSVAGI F 

QLSMSDGAS P PLPMSIiAVD I LPSAI EVQLRAPLEVPQALGRSSL 

SQOOLRWSDREEPKAAYRIilQGPOYGHLLVGGRPISAFSQFQI 

DQGIftn/FAFTNFSSSHDHFRVLAIJu^GVNA 

WAGGPWPQGATLrO^PTVLDAGELANRTCSVPRFRLLEGPRIIOR 

VVRVPRARTEPGGSQLVEQPTGX?DLfUX5RLGIiEVGRPEGRAPGP 

AGDSIiTLELWAQGVPPAVASLDFATEPYNAARPYSVALLSVPEA 

ARTEAGKPESSTPTGEPGPMASSPEPAVAKGGFI>S FLEAWMFSV 

I IPMCLWLrJLiLAIilLPLLFYLRKRNKTGKHDVQVLTA^ 

GOTF/TFRKVEPGQAJPLTAVPGQGPPPGGQPDPEtJjQFCRTPNP 

ALKNGOYWV 


54 SI 


l 


2274 


RDSSEWRTGDTI^RPSACI^AIiKPPCLWRWHERGKKDRDS 

KNSEPGSpHSLFJUjRDAAPSQGLNFIfLLPTKMLFIFNFLFSPIiP 

TPALI CILTFGAAI FLWLITRPQPVLPIiLDLNNQS VGIEGGARK 

GVSQKNNDLTS CCFSDAKTMYEVFQRGIAVSDNGPCLGYRKPNQ 

PYRW LS YKQVSDRAE YLGSCLLiHKG YKS SPDQFVGIFAQNRPEW 

IISELACYTYSMVAVPLYDTIiGPEAIVHIVNKADIAMVICDTPQ 

KALVLlGNVEKGPTPSLKVirLMDPFDDDLKQRGEKSGIEILSL 

YnAENLGKEHFRKPVPPSPEDI^VlCFTSGTTGDPXGAMITHQN 

IVSNAAAFIJ(CVEHAYEPTPDDVAISY1.PLAHMF^RIVG^WVY3 

C«ARVGFFX)GDIRLI.ADDMKTLKPTL^ 

AKTPLKKFIiLKlAVSSKFKEJk^ 

GGR^VlVTGAAPMSTSVMTFFRAAMGCQVYEAYGQrECTGGCT 

FTLPGDHTSGHVGVPLACNYVKI^DVADMNYFTV^^ 

1 WFKG YLKDPEKTQEALDSJX5WLHTGD IGRWIiPWGTLKI IDRK 

KNIFKIJ^ErYX^EKIEKIYimSQPVLQIFVHGESLRSSLVGV 

VVPl)TDVLPSFAAKLGVHGSFEEJiC^^ 

SGLKTFEQVRAIFLHPEPFS IENGLLTPTLKAKRGEI»SKYFRTQ 
IDSLYEHIQD 


S452 


1833 


1138 


SRVPSLC^I^J^LSPSREPVAG^PGCGTAGPPAMATLKGGLIiR 
LGSLI*SI>S CLALSVLLLAQIiSDAAKNFEPVRCKC I CPP YKEfJ SG 
HIYNKJNISQKDCDCliHVVBPMP\n^PDVEAYCIiRCECKYEERSS 
VTIKVTI IIYLSII^IaIJJ,YMVYLTL^ 

DDDIGDHQPFANAHDVLARSRSRANVLNKVEYAG^RHKLQVQEQ 
RKSVFDRHWLS 


5453 


111 


1520 


PSIPAAVPQSAPPEPHREBTVTATATSQVAQQPPAAAAPGBQAV 

AGPAPSTVPSSTSKDRPVSQPSIiVGSKBKPPPARSGSGGGSAKE 

PQEBRSQQQDD I EEIiETKAVGMSNDGR FLKFDI E IGRGSFKTVY 

KGLETETTVEVAWCBI^DRKLTKSFJiQRFKEEAEMLKGLQHPNI 

VRFYDSWESTVKGKKCIVLVTELMTSGTIJ^ 

RS WCRQI L KGLQFTiHTRTPP I IBRDXjKCDHIFITGPTOSVKIGD 

I^LATI^RASFAKSVIGTPEFMAPBMYEEKYDBSVDVYAK^ 

IiEMATSEYPYSECQKAAQIYRRVTSGVKPASFDKVAIPEVKEII 

EGCIRQNKDER YS I KDLLWHAFFQEETGVRVELtAEEDDGEK I AI 

KLWLR IED IKKLKGKYKDNEAI EFS FDLERNVPRDVAQEMVE SG 

YVCEGDHKTMAKAI KDRVSLI KRKREQRQL* 


5454 


111 


1520 


PSIPAAVPOSAPPBPHRBBTVTATATSG^G^PPAAAAPGEOAV 
AG PAPSTVPS STSKDRP VSQPS I»VGS KEEP PPARSGSGGGSAKE 
PQKERSQQQDDIEBIiETKAVGMSNDGRPliKFDlEIGRGSFKTVY 
KGIjDTETTVEVAWCn&iQORKLT]^ 
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amino acid 
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amino acid 
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Predicted end 
nucleotide 
location 
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to first 
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amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C- Cysteine , D=Aspartic Acid, E= 
Glutamic Acid, F^Phenyl alanine, G=Glycine, 
H=Histidine, Idsoleucine, K=Lysine, 
L*>Leucine, M^Methionine, N*Asparagine, 
P^Proline, Q^Glutamine, R=Arginine, 
S=Serine, T=Threonine f V=Valine, 
W=*Tryptophan, Y=»Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\s»possible nucleotide insertion) 








WFYDS^BSl^KGKKCIVLVTEL^SGTLKTYLKRFKVMKIKVI, 
RSWCRQILKGI^FLHTRTPPIlHRDIiKCDNIPITGPTGSVKIGD 
LGLATLKRASFAKSVIGTPEFMAPEMYKRKYDBSVDVYAPCaviCM 
LEMATSEYP YS ECQNAAQ 1 YRRVTSG VKPAS FDKVA1 PEVKEI X 
EGCIRQNKDERYS I lOJLLNHAPFQEBTGVRVELAEBDDGBKIAI 
KLWIxRIKDI K1CIJCGKYKI»NEAIEPSFT>IJBRNVPEDVA0EMVKSG 
YVCEGDHKTMAKAIKDRVS JUIKRKREQRQL * 


5455 


1359 


377 


LTMVS PATRKSLPKVKAMD F1TSTAI LPIjLFGCIi3VFGljPRr xlJQ 
WTOGKAYliRNAVWI TGATSGIX3 KECAKVF5TAAG 
GALESLIP^LTASHATKVO/rHKPYLVTFDLTDSGAIVAAAABIL 
- QCFGYVDI LVNNAG I S YRGTIMITrTViyTOKRVMETNYFGP VALT 
KAUjpSMIKRRQGHlVAISSIQGKMS IPFRSAYAASKHATQAFF 
DCLRAKMEQYKIF.VTVI S PGYIHTNLSVNA1TADGSRYGVMDTT 
TAQGRS PVEVAQDVLAAVGXKKKDVT rADLLPSLAVYLRTTAPG 
LFFSLMASRARKERKSKNS 


5456 


2 


2332 


OGAGL VAAGAVLVLY PASRAGBRTRV?GS PAPSS LPLHS PGACG 

TEVDMDPQRSPLIOTKGNIBLKRPLIXAPSQLPI^SRLKKRPD 

QMBDGl^PBKKRTRGI^TTKITrSHpRVPSLTTVPOrC^ 

QKVSKKTCPR CSTAIATGLXI^KPVPAVPVQKS^ 

KKPSKRPAWDIJCGOJjCTIJaAELKRCIffiRTQ^ 

BAQO^VKAIfiTERTTLBGaLAKVQAQABO^ 

EBRLSTQEGI#VQKIjQKKQVEIjQEERRGIjMSQIjEE KERRLQTS ea 

ALS S SQAEVAS I*RQETVAQAAIX^ BREERLHGLEMERRRIiHNQIj 

QELKGNTRVFCRVRPVLPGEPTPPPGIjIjLFPSGPGGPSDPPTRIj 

SLSRSDERRGTIjSGAPAPPTRHDFSFDRVFPPGSGQDRVFESIA 

MIiVQSALDGYPVCIFAYGQTGSGKTPTMBGGPGGDPCJIiEGLI PR 

ALRH1iFSVAQEI>SGQGWTYS FVASYVEIYKBTVRDLLATGTRKG 

QGGECRlRRAGPGSEELTVTflARYVWSCEKEVllAlJjHLAR 

AVARTAQ^RSSR5HSVFQLQISGEHSSRGIX)CGAPLSLVDI»AG 

SERIJJPGLAIXSPGEREmdlETQAJNSSI^T^ 

VPYRNSKLTYXLQNSIXXJSAKWLM^ 

ASKVBPSVLFGTAQSlHUCmCTDPDIjCVCVCV^ 

MSMYRVRGGRVAGGCFXGWRAPCPRAIK 


5457 


2 


1540 


EOFVERRRWT-RTl'CbVRS PPHVPVCX3HACSWNGGSLDPLKGTPA 
LLRS AERI>5RKVKKIiRIiDKENTGSWR S F$I .ftSEGAERMATTGTP 
TADRGI1AAATDDPAARFQVQKUSWDGLRSI 1HGSRKYSGLI VNK 
APHDFQFVQKTDESGPHSHRLYYLGMPYGSREMSLiYSBIPKKV 
RKKALIJJ>SWKCWLDIIFOATPiniGVYSRBKKLLRERKRI J GVFGr 
TSYDFH^ESGI^FC^NSLFHCmKK?raraFWSPGPGCVSPMK 
PJ^IJmjCSGPR^PFCICPADPAFFSFINNSDLWVANIETGEER 
IU,TFCHO^I^SimjDDPKSAGVATFVIQEEFDRFTGY 
EGSEGIJCTriRIIiYEEVDESEVEVlHVPSPALEERKTDSYRYPRT 
GSKKPKIALKlAEFCm^O^KIVSTQRKRLVQPFSSI^KVKYI 
ARAGWTRDGKYAWAMPLDRPOX2WIJQLVLLPPALFI PSTENEEQA 
AShCQS CPQECPAVCG VRGGHQRLDQCS 


5458 


6642 


4022 


FVPGLREPQWE PAQPS ATMSAPSBEEE YARIjVMEAQP EWLRAEV 
KRLSHELAETTREKIQAAEYGIAVIiEBKHQLKLQ 
IRSEMEQLKE^FGQAHTNHIOCVAADGRSREESLI QESASKEQYY 
VRKVLBLQTEIjKjQLRNVLT^ 

QRGRLRDD IKEYKFREARIXQD YS KIiRRENI SliQKQVSVLRQNQ 
VEPEGIJCHEIKHLEEETEYIiNSQIxEElAIRIJCEIS KRQLBBAIiET 

T.TTIT^I^iriJQTiT3irPTJCttVMQTTJT\CWV , PQWTJJ\roT TVT inroTMVUn 
CAO\£JU*0£JiUUMj&Il * nil JJSLtOl? X 1 ZitULitl V SlJlAjlJllJfaUUAA 

EPNPTOAEALVNGFEHGGLAKLPIJDNKTSTPKKEGLAPPSPSLVS 

DI^BLNlSBIQKLKOXillWMERiKAGXIATLQDTQK^ 

SLSEQQEKVTRLTPlNIjS AL.R RLQAS KE RQTALDNE KDRDSHEDG 

DYYEVDIWGPEIIACXYHVAVAEAGELREQLKALRSTHEAREA^ 

HABE3CGRYEAEGQALTBKVSLLEKASRQI>J^^ 

DVAGETQGSLSVAQDEI,VTPsEBLANLYHHVCKaraETPmmL 

DYYREGQGGAGRTSPGGRTS PEARGRRSP I LLPKGMAPBAGRA 

IX^TGDSSPSPGSSLPSPI^DPRREPWJIYNLIAIIRIXJIKHLQ 

AAVDRTTBIiSRQRlASQEI/5PAVDKDKEALMEEILKI*IC3I«^ 
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Jrj.eou.cc. eel 
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to first 
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amino acid 
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nucleotide 
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corresponding 

CO I. list 

amino acid 
residue of 
amino acid 
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Amino acid segment containing 3ignai peptide 
{A^Alanine, C=Cysteine, D=Aspartic Acid, E~ 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histidine, I-lsoleucine, K=Lysine, 
u-beucine , M-Mouiixonine, rj^Asparagine, 
P=Proline, Q=Glutamine, R^Arginine, 
S=Serine, ^Threonine, V~Valine, 
W=Tryptophan, Y=Tyrosine, X "Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








REQITTj^WIiKANKQTAEVAIJW^ 
NELKJ\LKEnAATFSSl»RAMFATRCD^ 

VIT*PT MOT T Tl Hd fv Taavt X T m/tnT t>t t r if nrTD/vnnnnm **» v.fr MT * 

KJv. 1 lji>iSIjbKMAJ.QQKJjALTyRLE 
PATPSVSHTCACaS DRAKGTGLANQVFCS EKHS XYCD 


5459 


316 


1262 


RGGHRLSGMASNFND 1VKQGYVRI RSRRLGI YQRCWLVFKKAS S 

KGPKRLEKFSDERAAYFRCYHKVTELMtfV^ 

GIYFNDDTS XT FACES DLEADEWCKVI^^ 

DLIATGVBREOJ»BREjWYTiMPS^ 

DVQNPRVKLISWI^AlJiRYGRDTl^ 

QTRTCBAIYQiWHSAAIJUAKOJHBRIiLQSVKNSMLQ^ 

SLSTMVPXjPRSAYW^ITRQHSTGQLYRI^UVSSPLKLHRTETF 

PAYRSBH 


5460 


45 


2097 


RPGCRAGEMTGSRARERVRWlVSAPOGODSRRCDPKVIsRGRSP 

GLGLAEMPSC^ACTCGAAAVPXITSSLASA^ 

GRI^TFETQ1LQI^PLRSFTETPAYPASKI)GISKIX3SGIX5NKKS 

ASBGSSKKSGSGNSGKGGKQIJlCPKCGDiXTHVET^ 

CEKCRHPFVVLSEAllSKICSIIKEPESAAB^ 

KJ YNYI^KYWGOSFAKKVIaSVAVYNHYKR I YNNIPANIiRQQAB 

VBKQT3LTPREl,EIRRREDE^RFTIO^IAGISP«GNALGASMQ 

OQVNQQIPQEKRGGEVLDSSHDDI KLEKSNIfcLLGP'rGSGKTIJj 

AQTIiAKCjjD VP FAI CDCTTLTQAG YVGEDI ESV IAKLLQDtAtfYN 

VBKAOXX5XVF1^EVDKIGSVPGIHQIjM3VGG 

TIVNVPEKWSRKI^RGBTVQVDTTNILF^ 

NKKYLGFGTPSNLGKGRRAAAAADIJVfreSGBSNTlIQDIEEKDRL 
IJUIVEARDL1EPGM1PEPVGRLPVWPLIISIJ>EKTLVQILTEPR 
NAVI PQYG^FSHDKCEIdmT2DAl.KAXARlA^ 
IMBKXIJiKPMFEVPNSDlVCVEVDKWVEGKKEPGYIRAPTKES 
SEBEYDSGVEEEGWPRQADAANS 


5461 


1481 


160 


INPPPPPKSPCGPJ\FJCWRRPJ*RPGAPEAAVM 
SHR&PGDCFI^VI^YAPVGF^b^ 

S VLRR FWRTM CAVLGLVARQEDSGLRDHS VRVL I SNUVTPFDH 
NXVNLLTTCSTPIJ^SPPSFVCWSRGPME>INGRGBt J VESLKRFC 
ASTRLPPTPLLIiFPEEEATNGREGLLRFSSWPFS iqdwqpltl 
QVQRPLVSVTVSimSWVSEI^HSLFVPFTVYQVRWLRPVHRQIjG 
EANEEFAIiRVTOLVAKBLGO/TCTROT^ 

Q5AQSS PPttBqVSPDVQIJLTIJ^RVKKVLPHVPlJ3VI<^LW 
TGCVBliTITNLI^GAVAFl^EDITKGt^LPTASASj^PSSGPV 
TPQPTALTFAKSSI^QESl^ERKO^VWARRRPTKimAQEAD 


5462 


663 


3353 


KIKERQMSANKSPPSAQKSVLPTAIPAVl^AASPCSSPKTObSA 

RLSNGSFSAPSLTNSRGSVinTVSPlJ^IGLTRESVTIEAQELSL 

SAVKD1 VCS IVYQKFPECX5FFGMYDK^ 

SADE IHEGDLVEVVLSALATVBJJFOIRPHTL^ 

CGmLmhVRQGliKCBGCGUfYnKRCAFKTPmCSGVR 

VSLPGPGI>SVPPJ?IXJPEYVALPSEESHVHQEPSKRIPSWSGRPI 

WMEKMVMCSVKVPHTFAVHS YTRPT I CQ YCKRLI>KGLPRQGMQC 

roCKFNCHKRCASXVPRDCLGBVTFNGEPSSIi^ 

NDINSDSSRGLDDTEBPSPPEDKMFFLDPSDLDVERDEBAVKTI 

RHYTOIJJSKCLTIiFQNBSGSKYYKEIPLSEILRISSPRDFTNIS 
QGSNPHCFEIITIrlWVYFV^ 

WKKAIRQALMPVTPOASVCTS PGOGKDHKDLSTS ISVSMCQI QE 

IWDISTVYQIFADEVLGSGQFGIVYGGK^ 

RFPTKQRSQLRNEVAIIiQNLmiPGrV^^ 

LHGDf^EMILSSBKSRIiPERITKFMVTQILVALRl^IFKNIVHC 

DLKPENVLIiAS ABP F PQVKLCD FGFAR I IGEKSFRRSWGTPAY 

LAPEVWcSREYiStj^Ii^^ 

QNAAFM YPPNPWRE I SGKA1 DIj IIWLLQ VKMRKR Y$ VDKSLSH P 
W^DYG/TVMI^EFETRIGER Y ITHESDDAR^IHAYTHNLVYP 
KHPIMAPNPDDMEBDP 


5463 


237 


1012 


LLSVimTSRCSHl^EVi^DCTS^ 

Y VMOVSAKDGQliLSTVVRTIiATQS PFNDRPMCRI CHEGS SQEDL 
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amino acid 
residue of 
amino acid 
sequence 


(A^Alanine, C= Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F^Phenylalanine, G-Glyeine, 
H=Histidine, I=Isoleucine, K^Lysine, 
L= Leucine , M=Methionine, N=Asparagine , 
P= Proline, Q -Glut amine, R^Arginine, 
S=Serine. T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, X«Unknown, *-5top 
Codon, /^possible nucleotide deletion, 
\apossible nucleotide insertion) 








LSPCBCTCTLGTIHRSCLBHWLSSSNTSYCKLCHPRyAVBHKPR 
PLVEWLRNPGPQHE KRTLFGDMVCFXFITPLATISGWLCLRGAV 
DHK4FSSRLEAVGLIALTVALFT I YXFWTLVS FRYHCRLYNEWR 
RTNQRVIbLIPKSVNVPSNOPSLLGtHSVKRNSKETVV 


5464 




677 


S PSMNPRKKVDLKIj 1 1 VGA10VGKTSLLHQ YVHKTF YEE YQTTL 
GASILSKIIIIiC3DTTLKIX}IWDTGQQBRVRSMVSW 
LAFDVTDLESPEAJ^IWRGDVIiAKIVPMEQSYPMVLLGNKTDIiA 
DRK YQS I LENHLTES IKLSPDQSRSRCC 


5465 


5278 


3348 


KGDPREFIRVHREALECDYVSAHLHEW IDLIFGYKQQGPAAVBA 
VKTVFHHLPYEGQVDI YNINDPLKETATIGFINNPGQI PKQLFKK 
PHPPiatVRSRI^DNAGISVLPGSTSDKIFFlIHIiDNl.RPSri'XPV 
KELKEPVGQIVCTDKGlIiAVEQNKVLI PPTHHKTPAHGYADLS C 
RLGTYBSDKAMTVYECLSEWGQILCAICPNPKLVITGGTSTVVC 
VW ISMt* AoKK IU\KT V TI>KQ AJL WjHTU 1 vl CATAS LiAY H J. I VSGSR 
DRTC 1 1 WDLNKIjSFLTQI>RGHRAPV^7\I^ JNKLTGDI VS CAGT Y 
IHVWS INGNPI VS VNTFTGRSQQI I CCCMSEMNEWDTQNVI VTG 
HSDGWTOFHRMEFI^VPBTPAPEPAEVLBMQKDCPEAQIGQEAQ 
DEDSSDSBADEQSISQDPKDTPSQP3STSHRPRAASCRATAAKC 
TDSG3DDSRRNSDQLSLDBKDGFIFVNYSEGQTRAHLQGPLSHP 
HPNP I EVRNYSRJLKPGYRWERQLVFRSKiTMHTAFDRKDNAHPA 
EVTALGISKDHSRXLVGDSRGRVFSWSVSDQPGRSAADHWVKDR 

S3 PVR VCQNCYYNLQH ERGSEDG PRNC 


5466 


3 


992 


HACAHASAHASGRLVRWWRKRRSVMGIQTSP^ 

LGIAVGSYLVRRSRRPQVTLI^PNEKYLLRI^KTTVSHNTKRF 

RFALPTAHHTLGLPVGKHIYXSTRIDGSLVIRPYTPVTSDEDQG 

YVDLVIKVYLKGVHPKFPBGGKMSQYLDSLKVGDWEFRGPSGL 

LTYTGKGHFMIQPNKKSPPBPRVAKKLGMIAGGTGITPMLQLIR 

AlLKVPEDPTQCFIiLFANQTEKDI ILiREDLEE LQARYPNRFKLW 

FTIiDIIPPKDWAYSKGFVTAiDMIREHLPAPGDDVLVLLCGPPPMV 

QLACHPNLDKLGYSQKNRFTY 


5467 


2103 


4 


GEA^VGTRGCRRDLPDPO^IFIQKKDLEEDESVTAAIILKSRG 
RSPRK I DQPCNS SNMVHGSVTFRDVAI DPS QBE WBCLQPDQRTL 
YPJ^VMLE^YSIfLISIiAGSSISKPDVITLI^OEKEpWMVVRKETS 
RRYPDLBLKYGPBKVS PEWDTSBVNIjPKQVI KQ I STTLGIEAFY 
FRNDSEYROFKGLQGYQEGNINQKM I S YEKliPTHTPHASLICNT 
HKPYECKBCGKYFSCGSNLIQHQS IHTGEKPYKCKSCGKAFQLH 
I QLTRHQKFHTGBKXFB CKRCGKAFN1>PTQLNRHKN IHXVKKLF 
BCKBCGKSFNRSSNLTQHQS IHAGVKP YQCKECGKAB'NRGSNIiI 
QHQKIHSNBKPFVCKBOGMAFR YHYQI. I EHOQ I HTGEKPFBCKE 
CGXAFTI^TKLVRHQKIRTGBKPFBCREOTCAFSLI^ 
IHlGEKPFECKECXlKSF^RSSNLVOJiQSIE^IKPYECKKCGKG 
FI^AHI»IOHQKIHSJ!ffiKPFVCKECBMAFRYHCQl> I EHSR I HTG 
DKPFECQDOGKAFNRGSSLVQHQS IHTGKKPYBCKECGKAFRLY 
L<JI^QHQKTHTGEKPFECKECX5KFFRRGSNIiNQHRS IHTGKKPF 
ECKBCXyKAFRLHMKT.IRHQXLHTGEKPFB 
RHQKLHTGBKPFBCKE<X3KVFSLPTQ 


5468 


225 


2976 


SFLTDLFQSLAQLENIiCKQLYET^DTTTRLQAEKi^ PD 
CLSKCQLUjERGSS S YSQLLAAXXILTKLVSRTNKPLPIjEQRID I 
RNYVXJSTYLATRTKIATFVTQALIQLYARITX 
RNAITDVTRFIQDSVEYCI I GVTI LS QI/TN EINQVS ATAFL I EA 
DTTHPLTKHRKI AS S FRDS SL FDI FTLS CNLLKQASGKNLNLND 
JSS(^GLLMQI^KLTHNCLNFDFIGTS TDESSDDLCTVQIPTS WR 
&AFLDSSTLQLSTIGRCBYKKTCALLVQI.FD0^AQSYQBLLQ3A 
SASPMDlAVQEGRLTWIiVYI IGAVTGGRVSFASTDEQDAMDGEL 

KSSKLYRJOiSEVLGLNDETMVLSVFIGKI ITNLKYWGRCBPITS 
KTI^IJTOLSIGYSSVRXLVKI^AVQFMIJ^TSBHFSPLG^ 
QSNLTDMRCRTTJm^ALGItfJJIVD^ 

AVAQMFSTNSFNEQEAKRTLVGLVRDLRGIAFAFKAKTS PMMLF 
BWIYPSYMPlLQRAIEfcWYHDPACTTPVIja^ 
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SEQ 
H> 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 

ECS J. GRlc OI 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(AfaAlanine, C=Cysteine, D=Aspartic Acid, E- 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
HoHistidine, I=Isoleucine, 10- Lysine, 
L=Lencine, M-Methionine, N=Asparagine, 
P~Proline, O^Glutatnine, R^Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrooine, X -Unknown , *=Stop 
Codon, /^possible nucleotide deletion, 
\ -possible nucleotide insertion) 








FDVSSPNGII^PT^ETSKMITMYGKRILTLGEVPKDQVYALKLKG 
ISICFSMLKAALSGSYVKFGVFRLYGDDALDNAIiQTPIKLLDSI 
PHSDLLDYPKI^QSYYSLLKVLTQDHMNFIASIiEPHVIMYIIjSS 
ISEGbTAIiDTMVCJX3CC3CIJ}HIVT^ 

ESDRFIiHIMQQHPBMIQQMIiSTVliNIIIFEDC3mQWSMSRPIXG 

LIIxLNEKYFSDIJWSIVNSQPPEKQQAMH^ 

TKNRDRFTQNLSAPRREVNnSMICNSTYGVNSNDMMS 


5469 


134 


2653 


DQEFSTSI*VPWHI>PMGWLCSGliFPVSCLVIiI^V7^SGNMim^ 
BPTCVSDYMS IfiTCBWKMNGPTMCSTELRLI»YQLVFI»LSBAHTC 
VPEm^GAGCVCHIJ^DWSADNYTI^LWAGQQLLWKGSFKPS 
EHVKPRAPGNL1VHTNV5DTIjIJ/TWS 

IWSEITOPADFRIYNVTYI^PSLRIAASTI.KSGISYRARVRAWAQ 
CYMTTWSEWSPSTKWHNSYRBPFEQHliLGVSVSCrrVIIAVCLL 
CYVS ITKIKKEWWDQIPWPAjRSRLVAIIIQIIAQGSQWEKRSRGQ 
EPAKCPHWKNCLTKUjPOTISHNMKRBBDPHKAAKEMPFQGSC^ 
SAWCPVEISKTVLWPBSISVVRCVEL7EAPVECE3EEEVKEBKG 
SFC^SPESSRDDFQBGREGIVARLTESIiFLXlLLGBENGGFCQQD 
hJGBSCTLLPPSGSTSAHMPWDEFPSAGPKBAPPWGXKQPIiHLEPS 
PPASPTQSPDNTiTCTBTPLVIAGlgpAYRSFSNSLSQSPCPRBLG 
PDPLIMHLEBVBPBMPC^PQLSEPTTVPQPEPETWEQI LRRNV 
IjQHGAAAAPV^PTSGYQEFVHAVE(XXnX)ASAVVGLGPPGEAG 
YKAFSSIddAS SAVSPEKCGFGAS SGEEGYKPFQDLI PGCPGDPA 
PVPVPLFTFGLDRBPPRSPQ5SHLPSSSPEHLGLEPGEKVEDMP 
KPPLPQEQATDPL VDS LGS G rVYSALTGHLOG^KQClfGQEDGG 
Crn^VMASPCCCCC^imASPPTTPlsRAPDPSPGGVPLEASLCPA 
SLAPSGISEKSK3SSSFHPAPGNAQSSSQTPKIVNPVSVGPTYM 
RVS 


5470 

• 


17 


1418 


TACRIRTSIiNRGIAAVKROAVEMlASYGIjAYSLMKFFTGPMSDF 

KNVGLVFVNSKRDRTKAVLCMWAGAIAAVFHTL IAYSDLGY YI 

INKIJIHVDES VGS KTRRAFL YLAAFP FMDAMAWTHAG I LLKHKY 

SFLVGCASISDVIAQVVFVAJLLHSHLBCREPLLIPrLSLYMGA 

LVRCTTLCLG YYKNIHDI I PDRSGPBLGGDAT1RXHLS FWMPLA 

LIIJVTQRISRPIVNLFVSRDI^SSAATFAWILTATYFVGHMP 

YGWLTK I RAVY PAFDKNNPSNKLVS TSNTVTAAH I KKFTFVCMA 

I^TI^VMFWTPI^EKILIDIIGVDFAFABLCWPI^IFSF^ 

PVPVTVRAHLTGWLmXKKTFVI^SSVI^IIVLIA^ 

GVIHZATI^GVGSLJJXGFVGKSIW^AIA^CYVYRKQKICKMKNESAT 

EGEDSAMTDMPPTEEVTDIVBMREENE 


5471 


1868 


658 


RSSAPPGPQRAAAATAAAAAAGVKMAAAAAQGGGGGEPRRTEGV 
GPGVPG^VBMVKGQPFDVGPRYTQLQYIGF^AYGMVSSAYDH\TR 
KTRVAIKKJSPFEHQTYCQRIT^IQILLRFRHKNVIGIRDILR 
ASTLEAMRDVY I VQDLMBTDL YKLLKSQQLSNDH I C YFTjYQ I LR 
GLKYIHSANVLHRDLKPSNIXItm*CDLKICDFGrJ^IADPEHD 
HTSFI»TE YVATRH YRAPB I MLNSKGYTKSI DI WS VGC ILAEMLS 
NRPIPPGK^YIJ3QLNHir/^I7^^PSQKDLNCIINMFARMYIX)SL 
PSKTKVAWAKLF^KSDSKALDI^DRmTFNPNKRITVF^ALAHP 
YLEQY YDPTDEP VAEE PFT FAMELDDLPKERLKELI FQET7VRFQ 
PGVLBAP 


5472 


1469 


753 


LYVMARYIjSDEEVA VS I DRLCKANGRS PS I PFGTVRI PGRARVR 
DPQALW I FG YGS LVWRPDFAYSDSRVG FVRGYSRRFWQGDTFHR 
GSDKMPGRVVTU,EDHEGCTWGVAYQVQGEQVEKALirm^FJV 

IATQin^CttGFSGHNLBYIiraVRIW^ 
VGTMLPCFCPTEQALALV 


5473 


3 


2113 


FMNVKI>LIQDLBDlEQRVPVMDAQyKIITKTAHLITKESPQBBG 

KEMFATMS KLKEQLTKVKECY S PLL YBSQQLL I PLB ELEKQMTS 

PYDSLGKINEIITVLBRBAQSSALPKQKH^^ 

LIEWGSOSVQICFVTLSWIJCHF^QTRLQRQIADIHVAPQSMV^ 

TGDWKKirVETNSRLMKKFEBSRAELBKVr.RlAQfcX3Jah'EKGDPEE 

LIJ^HTEFFSQLT^R VI^B\FT>KACDEL'rD I LPEQEQQGLQEAVR 

KLHKQWKDLQGEAPYHLimKIDVEKNRFIiASAEECRTR^ 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C=Cystelne, D^Aspartic Acid, E=» 
Glutamic Acid, F=Phenylalanine, GMSlycine, 
H=Histidine, I^Isoleucine, K-Lysine, 
L=l»eucine, M«Hethionine, N=Aaparagine, 
P=Proline, Q-Glutamine, R=Arginine, 
S-Gerine, T=Threonine, V^Valine, 
W=Tryptophan, Y^Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








KLMPQEGSEKI IKEHRVPPSDKGPHHLCBKRI^LIBBLCVKLPV 
W>PVRDTPGTCHVTLKBLRAAIDSTYRKLMBDPDKWKDYTSRPS 
BFSSWISTNETQLKGIKGEAIDTANHGBVKRAVEEIRNGVTKRG 
ETTjSWLKSRLKVTjTEVS 8E>JEAQKQGDELAKLS S S FKALVTIXS 
EVEKMLSNFGDCVQYKEIVKNSIiBELI^ 

NLF EACXJLLIiHHQQKTJCR ISAKKRDVQQQ IAQAQQGEGGLPDRG 
HKKI^RKiBSTl^t^RSRERQERRIQVTLRKWRRFETNKETVVR 
YLPQTGSSHKRFLSFSSI>ESLSSELKQTKBFSK11TBSIAVQAEN 
LVKEAS EI PLG PQNKQLLQQQAKS I KEOVKKLEDTTiBBB YVIDK 
S 


5474 


2 


i 780 


TPD VRQLQASRRG IAVASWCBPRWFAGEKMAFVKS3WI»LRQSTI 
LKRWKKNWFDLWSDGHLIYYDIXrrR^ 

QKCRDTQPFDGKS KDCMLQ I VCRDGKT I SLC^KSTDDCLAWKFT 
LQDSRTNTAYVGS AVMTDETSWS S PP P YTAY AA? APE VGRTLS 
LQQAYGYGPYGGAYPPGTQVVYAANGOAYAVPYQYPYAGLYGQQ 
PANQVX rRBRYRDNDSDtAI^MIiAGAATGMALGSLFWVF 


5475 


2 


506 


ARGWIiBSLSLTCOOTPPPSSPCLIiHSPBTFIHTMPPNLTGYYRF 
VSQMOI BDYLQALNISlAVRKXAEiIiLKPDKEI EHQGNHMTVRTIj 
STFRNYTVQFDVGVEFEEDI^VDGRKCQTIVrWKRBliliVCVQK 
GBVPNRGWRHWLEGI^LYIil&TARDAVCKQVFRKVR 


5476 


192 


1457 


5TSMSU^FCrSRTG^SIJU>BKQSKTSIHQYLVDBPTLSWSR 

PSTRASF^/I^TRVSHYBLQVEIGRGFDNLTSVHLARHTPTGTrj 

VTOKlTNIiENCNBBRLKALQKAVIIfSHFFlW 

SHIiWVI S P FMA YGSASQIiLRTYFPEGMS BTLIRNI IjFGAVRGLN 

YI^QNGGIin^ IKASHILISGDGLVTI»SGLSHIiHSIiVKHGQRHR 

AVYDFPQFSTSVQPWLSPELLRQDJjHGYNVKSDI ysvgitacel 

ASGQVPFQDMHRTQMIJWJKIiKGPPYS PLDIS IFPQSBSRMKNSQ 

SGVUSGIGESVI«VSSGTHTVNSDRIiHTPSSKTPSPAFPSLVQLC 

UXJDPEKRPSASSLI^SHVFFKQMKEBSQDSILSI.LPPAYNHPS I 

SliPPVLPWTKPECDFPDEKDSYWBF 


5477 


3 


1044 


RGNSRLRYSHEDELQLPRIiPEIiFBTGRQI»I»DBVEVATEPAGSRI 

VQEKVFKGIJ>LLBKAAEMI»SQIiDIiFSRJ^DIjEBIASTX)LKY^ 

PAFQGALTMKQVWPSKRLDHLQRAREHF INYLTQ CHCYHVAE F3 

IJKTMIWSAEl^TWSSI^YPSLVAinASQROAKIQRYKaKKELE 

HRLSAMKSAVESGOADDERVRKYYLLHLQRWIDISLEEIBSIDQ 

EIKILREl^SSRRASrSNSSRQERPFVKPPILTRNMAQAKVF 

GYPSI.PTMTVBDWYBpjKRKYGAI^ 

EOKKKEEBDDEQTLlniAREWDDWKDTHPRGYGNRQNMG 


5478 


2 


835 


KTVRIWVPNVKGESTVFRAHTATVRSVHFrc^ 
VXVWATHRQKFLFSLSQHINWVRCAKFS PDGRLI VSASDDKTVK 
LWDKSSRECVHSYCEHGGFVrYVDFHPSGTCl^^ 
DVRTHRLLQHYQLHSAAWGI^ FHPSGWYliiTASSDSTLKILDl. 
MEGRLL YTLHGHOGPATTVAFS RTGEYFASGGSDEQVMVWKSNF 
DIGDHGBVTKVPRPPATLASSWGNLTVS ILEQRLTLKEDKLKQC 
LBNQQlilMQRATP 


5479 


2 


835 


KTVRJJJVPNVKOTSTVFRAHT^TVRSTO 

VKVKATHRQKFLFSLSQHINWVRCABCFS PDGRL X VSASDDKTVK 
I»WDKSSRECVHSYC^GGFVTYVt>FHPSGTCIAAAGM)NTVKVW 
DVRTHRUiQHYQLHSAAVNGLS FHPSGNYLITASSDSTLKILDI. 
MEGRIXYTLHGHO^PATTVAPSRTGBYFASGGSDEQVMVWKSNF 
DIGDHGEVTKVPRPPATIiASSMGlJT.TV^ T T.'PORr.TT.'R'RTiTrr.Wof 
LHNQQLIMQRATP 


5480 


444 


1952 


LSLTSRMEEABLVKGRIiOAITDiaiKIQBEISQKRliKIEBDKLK^ 
QHLKKKALRE KWnLDG I S SGKEQEEMKKQNQQDOHQ I QVLEQS I 
IJiIrf^IG#IJ2ieAEI#IST^^ 

EREERAEES IEDIYANI PDr,PKSYIPSRLRKE INEEKEDDEQNR 
KALYAMEIKVEKDI^TGEST^SSIPLP^DDFKGTGIKVYBDGQ 
KSVYAVSSNHS AAYNGTIX5LAP VEVBKIJjRQASERNSKS PTEYH 
BPVYANPFTRPTTPQRETVTPGPNFQERIKIKTNGLGIGVNESI 
HKMGJTCLSEERGNNFiraiSPIPPVPHPRSVTQOA^KI^rrTQKR 
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SEQ 
ID 
NO: 


" Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
reBidue of 
amino acid 
sequence 


nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, Phenylalanine, Gt=Glycine, 
H^Histidine, I^Isoleucine, K»r.ysine, 
u *Jc=i*k»A*its t n — picuiuQniuaf w^Asparacr ine / 
P-Pxoline, Q=Glutamine, R=OVrginine, 
S-Sexine, T- Threonine, V=Valine, 
H=Tryptophan, Y^Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








IiMTPWEKSNVMQDKDAPSPKPRLS PRETI FGKSEHQHSSPTCQE 
DBE0VRYNI VHSLPPDINDTEP VTMI FMGYQQAEDSEEDRKFLT 
GYDGI IHAELWIDDEEBEDEGEAEKPSYHPIAPHSQVYQPAKP 
TPLPRKRSEASPHEKHK3 


5481 


3 


1422 


NSPGSVCLCQCVCPSI^HCLPPLLLLLLLPMil^ESPQPPALRV 

vai ooukdi rwNiUiUJU* vu Ai^KFJULTaG^EKEKFB PTVFRDTLV 

QGLNKAGDDLEAVAKFIiDSTGSRl^ 

PGGlltEDIX^KTKHTKHCVraANBDHETIMiyAQfV^ 

YLEKAFEDEMKKLLLFLKAFSETEO/TKLA^^ 

I ItTS LFTD3LVKEG lAASFAVKLFTCAWMAEKDAM^VTSSIiRKAN 

LDXRI/LEI»PPVNRQSVDJIFAKYFTDACn/KEIiSDFrjlVCK)SI£T^ 

KEIiQKEIiQBRLSQECPIKEVVLYVKEEMKRNDLPETAVIGnOI^V^ 

CIMMAVBWinCKBELVJ^OALKHI.KQYAPI»IAVF^ 

UKVtiE YC xDN IHJb^KAFQKIVVLPYKADVI^KKAlLKWYKEAHV 

AKGKSVFXDQMKKFVBWLQNABEESESBGEEN 


5482 


1492 


528 


THVVMTGMCYAFHQVLS Y I NGVTTS KPGVSLVYSMPSRNLSLRIj 
EGLQEKDSGP YSCSVNVQDKQGKSRGHS IKTLELKVLVPPAPPS 
CRLQGVPHVXSANVTLSCQSPRSKPAVC^C^RQLPSPQTFFA^ 
LDVIRCKLSIiTOT^SSMAGVYVC^^ 

GAAWAGAWGTLVGLGL LAGLiVLliYHRRGKALBEPAND I KEDA 
IAPRTI^WPIKSiyriSKNGTLSSVTSARALRPPHGPPRPGALTP 
TPSLSSQALPS PRLPTTDGAHPQP IS PIPGGVSSSGLSRMGAVP 
VMVPAQSQAGSLV 


5483 


1 


788 


PFPFKGORAGRGNESDYRKIiEEMHQRFLVSERSKDDLQLRLXRA 

ENRI KQLBTDSSEEISRYQEMIQKLQNVI/BSERENCGLVSEQRIi 

KIiQQENKQLRK3TESLRKIAIiKAQKKAKVKISTMEHEFSI 

FBVQriRBMEDSNRNSIVEIjJUlLIjATOXJKAANRWKEETKKljTESA 

RlRINNLKSELSRQKIJiTQEIJ^QLEMANEK^AE^KLILEHQE 

KANRI^RRLSQAEEIUUISASG^LSVITVQPJRKAA^ 


5484 


3 

- 


1997 


IMADMEDLFGSnADSBAERKDSDSGSDSDSDQEHAASGSNASGS 

ESDQDERGDSGQPSNKEI.FGDDSBDEGASHHSG3DNHSERSMJR 

SEAS ERSDHED1TOPSD VDQHSGSEAPNDDEDEGBORSDGGSHHSB 

ABGSBKAHSDDEKWGRBDKSDQSDDEKIQNSDDEERAQGSDEDK 

LOWSDDDEKMQjmiDEBRPQLSDDERQQLSKEEKANSDDERPVA 

SDKDDl^QNSDDBEQPQI^DBEK^NSDDERPQASDEEHRHSDD 

n cLt^DWKbis i AKt^JJSElJEVIjRMKR KHAIASDSKADSDTEVPKD 

NSGTMDIjFGGADDI SSGSDGBDKPPTPGQPVDENGLPQDQQEEE 

PIPETRIEWIPKVmiJLGiroLYFVKLPNFLSVBPRPFDPQYYE 

DEFBDEEMI^EEGRTRLKIJCVEm'IRWPaRRDEEGNEIKES)^ 

IVKJISDGSMSliBlXaNEVPDVYKAPliOXroHNHLFlR 

VFKTKLTFRPHSl3)SATHR13iTI^ 

PECQRTEMIKKEEERT.RAS IRRBSQQRRMRBKQHQRGLSASYLE 

PDRYDEEEEGEES ISLAATlURYTCRflTRRRRM) T VQQTYQncftCB 

EDKAQPJljIiKAKXLTSDEVRPKLFWSRGLSCTQEPTALNEEL 
AGTN 


5485 


161 


1074 


KRK I LS 5MMDSEAHEKRP P I1»TSS KQDI S PH ITNVGEMKHYLCG 

CCAAFNNVAITFPIQKVLPRQX}LYGI 

RGILPPLMQKTTTIJUjMFGLYEDI^ 

VI AGTTEAI PTPI^RVQTLLQDHKHHDKFTNTYQAFKALKCHG I 
GEYYRGLVPI I^FRNGLSNVLFFXJLITCPI KB 
DFICX^LLGAMIX3FLFFPINVVKTRIQSQIGGEP05PT 
WLERDRXLINIjFRGAHIjNYHPvSLISWGIIl^TYBrXIaKVI 


54 B6 


1404 


142 


IPGSTISWSPJyVARGI>SVCRCCRj^PASAMDLFGDl»PEPERSPR~ 
PAAGKEAQKGPI^PDDIiPPASSTDSGSGGPLLFDDLPPASSGDS 
GS LATS I SO^VKTEGKGAKR^SEEEKNGSEELVEKKVCKAS sv 
IFGLKGYVAEPJCGEREF^DAHVII^DITBECRPPSSLITRVSY 
FAVFDGHGG IRASKFAAQNLHQNLIRKPP KGDVI S VBKTVKRCL 
LDTFKHTDEEFLKQASSOJCPAWKIXISXATCVI^VDN I I»YTANLG 
D8RAI LCRYNE BS QKHAALS LS KEHNPTQYEKRMRI QKAGGNVR 
TCRVIjGVLBVSRS IGDGQYKRCGVTSVPD I RRCQLTPKDRFI LL 
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SSQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F^Phenyl alanine, G=Glycine, 
H=»Histidine, l=Isoleucine, K=I»ysine, 
I*=Iieucine, M=Methionine, N^Asparagine, 
P=Proline , Q^Glutamine, R=Arginine, 
S=*Serine, T-Threonine , V«Valine, 
^Tryptophan, Y=Tyrosine, X^XJnknovm, *=*Sfcop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








ACDGLFKVFTPEEAVWPILSCJ^IJKKIQTREGKSAADARYEAACJ 
NRLANKAVQEGSADNVTVMVVRIGH 


5487 


535 


182 


AVSI^IRGIWPAPVPLPLQPCPSNCDMBRVTI^IXLAGLTA 
LRANDPFAMKDDPPYYDWKIS^QLSGIilCGGIiItAIAC XAAVLSGK 
CKCKSSQKQHS PVPBKAI PLITPGSATTC 


54B6 


1072 


259 


AMAASGBPQRQWQKEVAAVVWGSCMTDIjVSLiTSRLPKTGKTIH 

GHKFFIGFGGKGANQCVQAARLGAOTSIWCXV^ 

LKQNDISTK FTYOTKDAATGTAS 1 1 VNNEGQNI I VlVAGANLLIi 

NTEDLRAAANVI SRAKVMVCQLB I TPATSLEALTMARRSGVKTIj 

PWPAPAJADLDPQFYTLSjDVFCCNESEAEILTGLTVGSAADAGB 

AALVI*LKRGC^VVIITIAIASGCVVI»SO^PEPKHIPTC 

TTV5FKI 


5489 


81 


893 


GKG P VAAFI DQSNI FLTDPXI FJjGOWREEPKMPLIJjLGJS TBPLK 
LBRDCRSPVEPWAAASPDIJVrACLCHCQDLSSGAFPNRGVLGGV 
I.FPTVEMVIKVFVATSSGSIAIRKKO^EVVGFI^ANKIDFKEIiD 
IAGDEDNRRWMRENVPGEXKPQNGI PLPPQI FNBBQYCGDFD3F 
FSAKEE1TCIYSFIX3LAPPPDSKGSEKA3EGGETRAQKEGSBDVG 
NIiPEAQBKNEEEGETATEETBEIAHEGAEGEAEEEEETAEGEEP 
GEDBDS 


5490 


Bl 


893 


GKG P VAAF I DQSNI FT.TDPK1 FLGQVmBEPKMPIiLLlvGBTEPLK 
LBRDCRS PVE P WAAASPDLAIACIX^CQDI^SGAFPNRGVLGGV 
LFPTVEMVT KVFVATSSGS IAIRKKQQBWGFLBANKIDFKBLD 
IAGDEDNRRWMRENVPGBKKPO>raiPLPPQIFNEEQYCGDFDSF 
FSAKEEHIIYSFLGLAPPPDSKGSEKAEEGGBTBAQKEGSEDVG 
NLPBAQEKNEBEGErATEETEEIAMBaAEGEAEEEEETABGBEP 
GEDBDS 


5491 


204 


1194 


GSAPRI^IiGPTGAQARDPDWWARPPSRPYTQSKEDRPDTBGRSE 
QGDMASSFXiPAGAITGDSGGELSSGDDSGEVEFPHS PEIEETS C 
LAELFEKAAAHIOTLIO^^REQr»LYLYARYKQVKVGWCOTP^ 
SFFDPEGKX}KKBAWKAJbGDSSPSOAMQEYIAVVKEIJ)PGWNPQI 
PEKIKGKEANTGFGGPVISSIiYHEETIEEEDKNI FDYCRENN IDH 
ITKA I KS KNVBV1WKDEBGRAIJjHHAC33RGHKEL^ 
ZNGQDNEGQTAIiHYASACEFIiDIVBIiLLQS 
BBVTGCKTVSLVLQRHTTGKA 


5492 


3 


1896 


ASKNPLSAVCTTGIMSSLAVRDPAMDRSIJ^VFVGWtPYEA^BE 
QIiB0)I FSEVGSWSFRLVYDRETGKPKGYGFCEYQDQETALSAM 
RNL^REFSGRAI*RVDNAASBKNKEELK5 LGPAAP I IDSPYGDP 
I DPEDAPES I TRAVASLPPEO^ FBLMKOWKIj CVGNSHQEARNML 
I/^^IAYAXJ^O^VVMRIWDPEOUjKILHR^ 
SVSVSGPGPGPGPGLCPGPNVLLNQGJNPPAPQP^ 
PPI^TPIOGGIPAPGPIPAAVPGAGPGSLTPGGAMQPQLGMPG 
VGPVPl^GQVD^DPRAPIPRGPVTPGGLPPRGtXGDAPNDPR 
GGTIXSVTGPA/EPRG YLGPPBX^P PMHHASGIIDTRGPS SHEMRG 
GPIX^PRL1*IGEPRGPMIDQRGLPMDGRGGRDSRANETRAMETE 
VI£TRVMERiiGMETCAMETRGMBARGMDARGLEMRGPVPSSRGP 
MTGGIOGPGPINIGAGGPPQGPRQVPGI SGVGNPGAGMOGTG1 Q 
GTGMOX3AGItfH^O<^tC<SVSlO^ 

PSSFSPGQSQVTPQDQEKAALIMQVLQLTADQIAMLPPEQRQS I 
I>ILKEQ IQKSTGAS 


5493 


1 


1876 


RAPMMTKAVPE3PRKPGRLTOAINSPLTWEHVWICVPGGTPDOC* 

TOTFRVKRPBlRRSASNGHVPGTPVYREKEDKrDEIIELKTCSI*H 

VQKSDVDIJffiTXLRRLBEENSRJCDRQIEQLIiDPSRGTDFVRTLA 

BKRPDASWVINGtKQRILKLBXWCKEKDGTI SKLQTDMKTTN^ 

EMRIAI^TYYEBVHRI>COTiLASSETTGKKPIX5EKK^ 

GSALI^LSRSVQEl>TBF^QSLKRDLDRVI>S^^ 

SKPRIJ^IVFJ^KKX^VMESSKSHAAF^VR^HPPACLASSSAL 

HRQPRGDRNKDHERIJ?GAVRDLKEERTAI^EQI^RDI»E^^LL 

QAKADLBKBLECAREGE EERRERE EVLRE E lOTLTSKLQELQEM 

KKBEKEDCPEVPHKAQEIjPAPTPSSRHCEQDWPPDSSBEGLPRP 

RSPCSDGRRDAAARVLQ^AQWKVYKHKJCKKAVLDEAAVV^ 
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n> 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=Alanine, OsCysteine, D=Aspartic Acid, B» 
Glutamic Acid, F-Phenylalanine, G=Glycine, 
H-Histidine. l=Isoleucine, K=I»ysine, 
L=Leucine, ^Methionine , ifeAsparagine, 
P=Proline, Q=Glutaraine f R=Arginine r 
S=Serine, T=Threonine, V=»Valine, 
W^Tryptophan, Y^Tyroaine, Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\ "possible nucleotide insertion) 








GHLTRTKLLAS KAHGS EPPS VPGItPDQSSPVPR VPSPIAQATGS 
PVQKE^IVIIQSALRAHIJaiARHSATSKRTTTAASTRRRSASAT 
HGDASSPPPLAALPDPSPSGPQAVAPLPGDDVNSDDSDDIVIAP 
SLPTKNPPV 


5494 


71 


536 


RS KAKIGTPTREVPSTDMXVRRESSSSLTHRPAPS PATPRIiLGT 
RRVWjGVSEGTGC^DAMKLVLWLCSLLAPMV^ 
PFHYDYQTI.RIGGLVFAVVriFSVGIIit.ILSRRCKCSPNQKPRAP 
GDKKAOVENLITANATEPQKAEN 


5495 


273 


2168 


DSLLLIQViyrHPFTI^rJiSRLPSAIRSLILQKKPNlRNTSSMAG 

klrpaslvvlprslapaferf<^vntgplpllgqsepbkwmlpp 
qgaisetri^ghpofwkybfgactgsiiaslbqysbqlkdmvaffl 
gcsfslebalekaglprrdpaghsqiw^yktw 
lvvlmrplpkdklieglvraccsix^bqgqpvhmgdpbliigi khl 

SKPAYGmMVCTPTSEVPVFWPSPLTSLGAVSS^ 
CrVMTDLKDAKAPPGCLTPERlPKVHHISQDPLHYSIASVSASQ 

KIRFJUESMIG idpgnrg ighixckdbllkaslsi>shars vli tt 

GFPTHPNHBPPBBTDGPPGAVALVAFI^AI^lO^AXr^RAWN 
LHQKIVKDAVKQGVLKTQIPILTYQGGSVEAAQAPXCKNGDPQT 
PRFT)HLVAIRRAGRAADGNYYNAR KMKTIKHIfVDP IDDLFLAAKK 
I PG 1 3 STGVGDGGNELGl^KVnCEAVRRHIRHGDV I ACDVEADPA 
VIAGVSNWGG YALACALY 1 1* YS CAVHSQYLRKAVGPSRAPGDQA 
WTQALPS VI KBEKMbGI I>VQH KVRSGVSGIVGMKVDGLP FHNTH 
ABMIQKI.VDVTTAQV 


5496 


3 


2408 


QDTKMHKIYKGNI TPQLNKNTLKT8AATDVWAVYF3QFW I DY3G 

MKSGKGRPISPVDSPPl.SIwrCXJPTRYAESQKBPQrCNQVSINT 

SQSBSSDtAGRLIO^KKrJJCEyySTESEPLTNGGQKPSSSDTFFR 

PSPSSSEADDILIjVHVHISWSMQ^NHYQYXI^ 

NLRKDVBAVTGS PASQTS ICIGILtiRSABLALLLHPVDQANTLK 

SPVSE^VSPVVPDYIiPTRNGDFI^SKRXQrSRDINRIRSVTVNIi 

MSDKRSMSVDLSHI PLKDPLLFKSASDTNLQKGIS FMDYLSDKH 

I^KISEDESSGLVYKSGSGEIGSETSDKKDSFYTDSSSVLNYR3 

DSNILS FDSDGNQNI LSSTLTSKGNBTI ESIFXABI>LLPBAASIj 

SFmDISKEETPPVRTIJCSQSGLSGKPKERCPPmiAPIjCVSYKN 

MKRSSSQMSLDTISLDSMILRFQ^LBSIXJSDSFtMFLBKGNKKNS 

TTNYRGTABS VKAG ANLQNYGETS PDAI STNSEGAQENHDDLMS 

WVFKITGVNGBIDIRGEMBICI<JVNQVTPDQI^NIStJ»4YLC 

N RP VG SDQKA V I HS KSS PH I S I>R FESGPGAV I H S I/LAL'KNGFI>Q 

CHIKNFSTEFLTSSIJWIGHFI^ETVATVMPMKIQVSNTiaWL 

KDIKPPJ3STVSLKPAPVTVHII>HI*VVBR^ 

GltolJCENVKSDSVl^TSGKYDLKKD^ 

ANFPBPSFDFTHBQLMBBiroSItKQEajAKAEMALA 

HHIKKMTVB 


5497 


1821 


3308 


SIS KLLKRRSMIDAYIiIiSNSCAFFAPRl*FSLASQI IRBQQSPNV 
CFrYKYSGFPSr.BCQCHFVSPHSSCYINFFSFPPPFFVCFQLSN 
GPSHYSLSSESHVGPTGAGLFPHCLPASRLLPRVTSVHLPDYAH 
YYTIGPGMFPSSQI PSWKJJWAKPGPYDQPLWTLQRRKEKRBPD 
PNGGGPTTASGPPAAAEEAQRPRSMTVSAATRPGKEMEACEELA 
!*AI^P^I^IJ3rQRSSPJDSt^CSSGYST0/mPCX^EDTIPSQVS 
PYDYFSVSGDQBADQQBFDKSSTI PRNSD ISQSYRRMFQAKRPA 
STAGLPTTI^PA^IVTPGVAXIRRTPSTKPSWIRGTIGAGPIPIK 
TPVIPVKTPTVPDDPGVLPAPPDGPBERGBHSPESPSVGEGPQG 
VTSMPSSMWSGQA5VNPPLPGPKPSIPBBHR0AIPBSEAEDQBR 
EPPSAWSPC^IPESDPADLSPROTPQGEDMl^NAlRRGVKIjaCr 
TTNDRSAPRFS 


5498 


2434 


1492 


ILTHQEI FTGEKPCBCGKAS IQMSHLiSQQKI YSGEN PFACKVCG 

KVFSmCSm.TFJn^FHTRBKPFECNECGKAFSQKa^IKHO^mi 

TGEKLFBCl^CGKSFSQKBNLLTHQOHTGBKPFECKD^ 

QKSNLIRHQRTHTGEKPFVCTKECX5KTPSGKSKLTEHEKIH1GEK 

PFKCSE^GTAFGQKK^IKHQKIHTCBK^YECKECGKAFS 

L I VHVRIHSGDKP YECNVCGKAFS QS SS IiTVHVRSHTGE KP YGC 

NECGKAFSQFSTLAIJIIiRIfflGKKPYO^BCGKAFSQKS 
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NO: 


Predicted 
beginning 
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to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corre s ponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C=»Cysteine, D«Aspartic Acid, B= 
Glutamic Acid, F^Phenylalanine, G-Glycine, 
H=Histidine, I^lsoleucine r K»Lysine, 
L=fLeucine r M-Methionine, N=Asparagine , 
P=*Proline, Q=Glut amine, R=*Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine , X^Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








QKIHTH 


S499 


324 


926 


GFGQIGRGHKITTYPFSPRKSGRKGMAQSQGWVKRYIKAFCKGF 
FVAVPVAVTFIJ)RVAC\MVEGASMQ?SLNPGGSC^DVVIiIMI 
WKVRWFEVHRGDXVSLVSPKNPBQKIIKRV1ALEGDIVRTIGHK 
NRYVKWRGH1WVEGDHHGHSFDSNSK4PVSU5LLHAHATHILW 
PPBRWQJGUESVLP PERLPVQRBEE 


5500 


1978 


1286 


KPDWRI£J^PRI*YI*RSSRFGTOHLK^^ 
VKHKPVFIRl^TPGDRGVMMDISAPFFRDPPAPLGBPGKPFNELW 
DYBVVEAPFLNDI T^YIiBVBLCPHGQHLVLLLSGRRN^ 
PLSFRVSRGETKWEGKAYLPWSYFPPNVTKFNS FAIHGSKDKRS 
YBAIi YP VPQHBLOQGQKPDFHCIjE YFKS FNFhfTLIiGEEWKQPSS 
DLWLIBKCDI 


5501 


2927 


2226 


CRPPVSAR\>APGHQ(^VGGSGRRPARVBVVDAAARPSaRPFSLP 
AAlPuVALISRLIJWFRSLFWXBEiraLT^^ 
SGQPSEWIPTVGFNP1RKVTKGNVTI KIWDIGGQPRFRSMWERY 
CRGVNAI VYM IDAADRBK1 BASFJKBLHNLliDKPOI^Gl PVI»VLG 
NKRDI*PHALDEKC?I*I BKMNLSAIQDREI CCYS I S CKEKDNID I T 
LQWIilQHSKSRRS 




Q 

-J 


ft"? A 


GKFFKGGGSSKSRAAPSPGEALVRLRETEEMLGKKQEYLENRIQ 
REIAIJvKKHGTQNKRAALQAIJamaiFEi^jTQlMTLSTIEF^ 
REALENSHTOTEVLRNMGFAAiCAMra IT 
EQQDIAQB I SRAFSQRVG FGDD FDEDELMAELB ELEQEELNKKM 
TNIRIiPNVPSSSLPAQPNRKPGMSSTARRSRAASSQRABKEDDD 
IKQLAAWAT 


5503 


216. 


654 


KGVRRRGRVRSDS EDS HU3 Y FKMSFLLP KLTSKKEV DQAIKSTA 
KKVIjVIJiFX3RDEDPVCLQLDDILSKTSSDLSKMAAIYI*VDVDQT 
AVYTQYFDISYI PSTVFFFNGQHMKVDYGGEDPAJOR 8 IKAVRRT 
SPAGTLGBKPVKS 


S504 


58 


3563 


QLSFSFUAPVTFDDITVYXLQEEWVLIjSOQQKEIjCGSWKLVAP^ 

GPTVANPEI^FRKFGRGPF^WIXJSVOGORSLIiKHHPGKKQ^ 

FjiEVQGPTRESGQSLPPOKKAYTjSHIiSTGSGHIEGDHAGl^NRKL 

LKPRSIQKSWFVQFPWLIMNEEQTALFCSACREYPSrRDKRSRL 

IEGYTGP FKVETLKYHAKSKAB*IFCVNAIAARDPI WAARFRS IR 

DPPGDVLASPEPLFTADCP I FYPPGPIiGGFDSMAELI»PSSRABti 

BDPG<n)G&IPAMYIJX:iSDW*QKEI 

SCIQDPSAEQLSEBVPVVFEELPVVFBlJVAVYFTRBBWGMriDKR 

QKBljYRDVMRl^BUJ^I^PAAAia>DLISKLERRAAPWlKDP» 

GPXWGKGRPPGMKKMVAVREAimiASAAD^ 

CSSSlCEEGDGPRRIKRTYRPRSIQRSWFGQPPWLVtDPKETKI* 

FCSACXBR PNTjHD KS SRkVRG YTG PFKVETLKYHHVS KAHRLCV 

NTVB I KEDTPHTALVPE I SS DLMANMBHFFNAAYS IAYHSRPLK 

DFEKILQLLQSTGTV I LGKYRNRTACTQF I KYISETLKREIIJBD 

VRNS PCV8VJ.LDS3TDAS EQACVGI YIRYFKQMEVKES YlTLAP 

LYSBTADGYFET:TVSAIiI)BIjI>IPFRKPGWV^ 

GGLVBKFQE VI PQLLPVHC^AHRimAVVDACGS IDLVKKCDRH 

IRTVFKFYQSSNKRLNELQEGAAPLEQBI IRLKDLNAWWVASR 

RRTLHALLVSWPAIARHLQRVAEAG<^ 

VKFCK^LDFXSIYRPI^EVCXIKEI^ITEVNAII^ 

LIWQAGPKBEBFNASFKIX3RIjHGIC3^ 

LTGIBYLQQRFDADRP PQLKNMEVFDTMAWPSGIEIiASFGNDDI 
LNLARYEBCSLPlOTSEPJu^EBVn^LK^ 
AOHCRFPLLSKT^VVVCVPISTSCCERGFKAhlNRIRTDKRTKIi 
SNEVLNMLMMTAVNCWAVTBYDPGPAI^ 

OVQVPARJSPASARIJ^KEEMGALYVEEPRTQKPPir^SRFJVAEVL 
KDClMEPPERliLYPHTSQEAPGMS 


5505 


3312 


1219 


"NC^PRSLSAAKM5HRNMim»PSNLPQX^NLIKRDPP^ 
QYllffraSNVBIFlMPNKPSKELAELVMFMAQI 
PQEVTOIiI^amTVLDPDLRMTFCK^ 
FPEl^Cm>KIJ#3lKTLYTHIVTDIia?INAKH 
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residue of 
amino acid 
sequence 


Amino clC id S£ > eiTn*>nt > ' rrmt" a ini rtn e* -i /-mn 1 r%i-vT-\f- A A<*± 

(A= Alanine, C^Cysteine, D=Aspartic Acid, Bt> 
Glutamic Acid, F= Phenyl a lanine , G«Glycine, 
H=*Histidine, I^Ieoleucine, K=-Lysine, 
L^Leucine, Methionine, N=Asparagine , 
P-Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y«Tyrosine, X=Dnknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








YTMLRDSNATAAKMSI^VMIEL YRRNIVWDAKTVNVI TTACFS K 
VTKI LVAALTFFIX5KDEDSKQBSDS ESEDDGPTARDLLVQYATG 
KKSSKNKKlQLKKAMKVljKiCHRKKKKPRVF^F^aTPT.TTrnpnnT^a 

^^^^^ vm, m uv t wiuuuuuuvRr u v rjiy *ru-m- l. r^lj ± X%LJr*i^l J I* >\ 

BKLLKQLECCKERFEVKMMLMNL1SRLVGIHELFLFNFYPFLQR 
FLQPHQRBVTKILLFAAQASHHIiVPPEI IQSLLMTVANNFVTDK 
NSGE VMTVG INAIKH 1 TARCPIAMTE PJJjQDIAQYKTHKDKNVM 
MSARTLIHLFRTU^PQMLQKKFRGKPTEAS IBARVQBYGELDAK 
DYIPGAEVI^SKEENAKNDBDGWESTSLSKEEDADGBWXDVQH 
8SDEB(XiE ISKK1NSMPMBBRKAKAAAISTSRVLTQKDFQKIRM 
AQMRKBIinAAPGKSQKRKTCEIDSPEBPRGBII^XJlDlERIjHKK 
PKSDKETRLATAMj^KTDRKBFVRIOCnCIWPFS 
NFMMMRYSQJTVRSKNKRSFREKQLAIjRJDALLKXKKRM^ 


5506 


1 


1531 


FRGDliCGQHGGSAPGEGGSSAWPAPAHPLPBRHRKRKAIiCiPGRS 

CSGGGGEETPGTT PVWS PI*EGGGDEELRPNP YVRFPYRWWAVW 

lAAFPSI/SAG^ETPEAPPBSWTQLffFFRPVVllAAGYASFMVPGY 

IXVQYFRRKNYI1ETGRGLCFPLVKACVFGNEPKASDEVPI1APRT 

BAAETTPMWQAIiKLLFCATGIiQVSYLTWGVIiQERVMT^ 

TSPGERFTDSQFLVIAlKRVLALIVAGr^CVIjCICQPRHGA^ 

SFASLSNVLSSWCQYEALKFVSFPTOVIJ^ASKVIPV^ 

SRRSYEHWEYLTATLISIGVSMFLLSSGPEPRSSPATTLSGLIL 

LAGYIAFDSFTSNWQtBuVFAYXMSSV^ 

LEGXJALIjBGTRFMGHHS H FAAHALLLS ICS ACGQLFI FYTI GQP 

GAAVFTIIMTXdRQAFAlW^O^TC 

LRVYARGRLKQRGKKAVPVB3PVQKV 


5507 


3704 


1271 


FRGTRRCRPAGRASRRARRRPPCPGPAAPGSLEIGGFGTAAGKK 
VAVADVQFGPMRFHC^LQVIJiVPTKEDNaCNGFi^CKKAGFK 
CTVH'KFJVOAVIAC FIJ3KHHDI 1 1 I DHRNPRQIJDAEALCRS I RSS 
KI,SENT\n:VGWRRVDREELSVMPPlSAGFTRRYVEI^NtMACY 
KEX^I*EFGEVRSQIjaJ^CWSVFTAl^NSSDAI2ITSE0RFIQ 
YANPAFF/TTMGYOSGELIGKEIiGBVPINKKKADLLDTINSClRI 
GKEWQGI YYAKKKNGDNIQQNVKI I PVIGQGGKIRHYVS I IRVC 
NGliNKAEKISBCVQSBlOT)!^ 

E VSSQRRHSSMARIHSMT I EAP ITKVINI INAAQBSSPMPVTBA 
NBYVLSTKNTQMVSSNIITPISUDD^ 

FKI^AATHNRJ>LrYMJWFARFGICEF^ IB 

ANYHSSNP YHNSTHSADVLHATA YFLS KBRI KBTjLDPIDB VAAI» 

lAATIHDVDHPGRTNSFLCltAGSELAlIjY^ 

QLTTGDDKCNI FKNMERNDYRTLRQG 1 1 DMVI^TF^ITKHFEHVN 

KPVNSIinCPIrfm»BE»GBTDKNO^VI^^ 

KCADVSNPCRPLQYCIEWAARISEKYFSQTDEEKQQGLPVVMPV 
FDRNTCSI PKSQ IS FTDYFITDMFDAWDAFVDLPDIiMQHLDNHF 
KYWKGIiDEMKLRNLRPPPB 


5508 


1151 


*91 


LSSVFSRRSASMFAVGCSWJPFUfYWYLgLDRLPPASGLRGFPN 
VLKKVLVDQLVASPLLGVWYPI^IX^ 
EFYKADWC^WPAAQFVNFLFVPPQFRVTYTWGIjTI^WOT 
KYRSPVPLTPPGCVALDTRAD 


5509 


1238 


619 


RKSRGC*>NALSASGPAAAAAAIMVRXLKFHE^ 

VTDHbnjHELRVLPJiYRljQRRKDYTRYNG^RAVRELAR 

ERlXJPRVRASAALIJJKLYAlXniVP^RG 

LPTVLLKLRMAQHLQAAVAFVEQGHVRVG pdwtdpafl»vtrsm 
RDFVTWVDSSKIKREVLEYNEERDDFDLEA 


5510 


96 


1195 


PAGAHI*SSGSSEPLVEPGRGRVGARVKGERGLQASGSAPGRSKM 

AEGERQP PPDSS EEAP PATQNF I IPKKKIHTVPDMGKWKRSQAY 

ADYIGF I LTUJEGVICGKiajTFEYRVSEAIEKLVALIjmjDRW ID 

ETPPTOQPSRFGNKAYRIWAKI^ERAENL^ 

E VAVYLKES VGNS TR I DYGTTGHEAAFAAFLCCX»C^G VLRVDDQ 

IAI VTKVFmYLBV^KI^KTYTWEPAGSOGVi^J^DFQFI.P FI 

WGSSQIiIDHPYLEPRHFVDEKAVNENHKJDYMFIiECILFITEMKT 

GPFAEHSNQLWNISAVPSWSKVN(^LIRMYKAECI»EKFPVIQHF 

KFGSliRIHPVTSG 
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to first 
amino acid 
residue of 
amino acid 
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Predicted end 
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residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A- Ala nine, C~Cysteine, D^Aspartic Acid, B= 
Glutamic Acid, F»Phe»ylalanine, G»Glycine, 
feHistidina, I=Isoleucine , K=tysine f 
L=Leucine, M=Methionine, N^Asparagine, 
P=Proline, Q=Glut amine, R=Arginine f 
S«Serine, T=*Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *^Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


S511 


276 


1980 

• 


KLSRV1^I»PPBNLITSISAVPISQKBEVADPQLSVDSI»LBKDKD 

HSRPDIQVQAKRLAEKI^anWSBISTGQRTVNFKINRBLLTK 

TVLQQV I EDGSKYGLKSELFSGLPQKKI WBPSS PNVAKKFHVG 

HLRSTI IGNPlANLKEALGHQVIRimijGDWGMQ)EX3LLGTGFQt> 

PGYBEKLQSNPI^HLFE^^QVNKBAADDKSVAKAAQBmjRI^ 

LGDVOALSLWQKPRDLSIEBYIRVYKRI.GVYPDEYSGESFYREK 

SQSVhKhhSSKGhlAjKTXKGTAVVDl^mDPSSXCrrVMRSDGT 

SLYATRDIiAAAIDRMDKYNFDTMIYVTDKGQKKHPO^VE\)ML^ 

MG YDWAERCQHVP FG WQGM KTRRGDVTPI JBDVLNE I QLRMLQN 

MAS 1 KTTKELKN PQBTABRVGLAALI IQDFKGUJjS D YKFSWDR 

VFQSRGDTIQVFliQYTHARI^ 

VSILQHLLRFDE^YTCSSQDFQPRHIVSYI^^ 

QI KDSPPBVAGARIiHIiFKAVRSVLAl^KLLGiTPVCRM 


5512 


120 


1015 


DPSIiLLTlTVTGVTVLVLVLKSMNSRRRBPITLQDPEAKYPLPL 
IEKHKISHNTRRFRFGLPSPDHVljGLPVGtr^QLLAKIDNBLVV 
RAYTPVSSDDDRGFVDLI IKIYFKHVHPQYPEGGKHTQYI»ENMK 
IGETIFFRGPRGRLFYlIGPGNIXJIRPDG/rSBPKKlTiADHLGMIA 
GGTGITPMLQLIRKITKDPSDRTRHSLIFAHQTBEDILVRKELB 
BIARTHPIJQFDLVJYTljDRPPIGHKYaSGFVTADMIKBniiPPPAK 
ST1»ILVCGPPPIjIQTAAHPNIjBKLGYTQDMI FTY 


5513 


2 


837 ~ 


ARWRLPSDSPRIPPAGAETPGRGSCRNYLPSSSPPPPEPSSFPS 

PPTSRGGPGSRD'rMSUSEEESQDRQLKIVVLGDGASGKTSIjTTC 

FAQKTPGKQYKQTIGJ^FFLRRITLPG^n^NVTLQIV?DIGGQTIG 

GKMLDKYI YGAQGVLLVYD ITIvYQS FBNLBDWYTWKKV5KESB 

TQP LVALVGNKIDI^HMRT IKPEKRXRFCQENG FSSHFVSAKTG 

DSVFLCTOKVAAEILGIKI^KAEIKQSQK 

RTVNPPRSSMCAVQ 


5514 


1295 
# 


449 


VNRPS W IMGN FRGHALPGTFFF I IGLWWCTKS I LKYICKKQKRT 
arLGSKXI.FYRIiEILEGITIVGMALTGMAGBQFI PGGPHLHLYD 
YK0^^NQLW5WHHFTlWFFFGXLGVADILCFTISSLPVSI,TK£i 
MLSNAL FVBAFI PYNHTHGREMLD I FVHQLIj VL WFIiTG IiVAFIj 
EFLVRNNVI,LEI>I»RSSiabI J C^WFFQlGFVLYPPSGGPAWDI^ 
DHHNI Ij FLTI CFCWHYAVTIVTVGMNYA FITVfL VKSRLKR I.CSS 
EVGI*LKNAEREQRSEEEM 


5515 


1572 


260 


FVRLVGRGDCDPU^VObTTMPLYEGLGSGGEKTAVVIDLGEAP 
TKCGFAGETGPRCI IPSVIKRAGMPKPVRWQYNINTEELYSYIj 
KKFIHILYFRHIXVNPRDRRWI IBSVLCPSHFRBTLTRVLFKY 
PEVpSVIJ^SHLMAIJ,TIiGINSAMVI^^ 
VLNCUGAI«PliQGKAUaCBliETQIJ^ 

S VPEGVLEDI KARTCFVSDLKRGLKIOAAJCFWIDGNNERPSPP P 
NVDYPUXSEKlIiHlLGS IRX)SVVEILFECJ3NEEQ^VA!!I1»ILDSI* 
IG^PIimiKQIiAKNLWIGGTSh4LPGFl«RLLAEIRYLVEKPKY 
KKALGTKTFR1HTP PAKANCVAWLGGAI FGALQDILGSRS VSKE 
YYNQ TGR I PDW CS LNNPPLEMH FTJVGKTQPPLMKRAF3TEK 


5514 


3 


735 


NSREPPQAOTGPSPRKSPTASSFI^PWRPIJ^FFmGWGX^ES 
IKAMWRVPGTTRRPVTGBS PGMHRPEAMl^UIjTIiALIX^PTWAG 
KMYGPGGGKYFST^TEDYDHEITGDRI/SVGLLLVKSVQVKIjGDSW 

dvklg alggn'rqevtlqpgk y i tkvfvafqaflrgmvmyts kdr 
yfyfgkldgqissaypsqegqvi.vg1ygqyqllgiks igfbwny 
pleeptteppvnltysanspvgr ■ 


5517 


246 


499 


SE I YVAMRTDSS KOTDVESGVANFASSARAGRRNALPD IQSS AA 
TIXJTSDLPLJa^EALS VKEDAKE KDEKTTQDQLEKPQNE EBC 


551B 


3 


1375 


DAHADAtmtftWDI^MDFPCLWLGLLbPLVAALDFNYHRQEGMEA 
FLKTVAQNYSSVTKMSIGKSVKGRNLV^ 
EFKYVAl^GDETVGREtIAUl>lDYLVTSDGKDPE ITiSTLINSTR 
IIaMPSmPIX^FEAVKlG?IXnrYSIGRK^miQYDLNRNFPnAFBY 
NNVSRQPBITOVMKWUOKTFVLSANI^ 

TGALYSRSLTPDDDVF^YIiRHTYASRNPlWKKGDECKNKMNFPN 
GVTNGYStfyPLQGGMQDYNYIWAQCFElTLELSCClCYPREEKLP 
S FWNNNKASIiIEYI KQVHLGVKGQVFDQWGN PLPWVI VEVQDRK 
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ID 
NO: 


beginning 
nucleotide 
Xo cat. ion 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


nucleotide 
location 

conr^snondi nry 

to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A- Alanine, C~ Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 

L=Leucine, M=Methionine, N=Asparagine / 
P^Proline, Q=Glut amine, R=Arginine, 
S=iSerine, T=Threonine, V=»Valine, 
W=Tryptophan, Y^Tyrosine, X-Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=posaible nucleotide insertion) 








HICPiTrmKTGEYYIiLI^PGSYIINVTVPGHDPHITKVIIpEKS " 

QWFSALKOTILI.PFXX^bDSIPVSNPSCPMrPLYlWLPDHSAAT 

KPSI*FLFLVSLLHIFPK 


5519 


57 


A77 


X <U> AXaMyv" IS vytb ii W KJj A IIATAj Fx rivj r\t£ bO WDI^KAAVAAVVG 

G WAVOTVI,VAl*S AMQ FT3VGIAASS IAAKMMS TAA1 ANGGGVA 
AGSLVAILQSVGAAGI^VTSKVIGGFAGTAIXAWL^PPSS 


5520 






1? I'J&ttKUKV x»fc.i If X V f KijJUjAM 1 M"S> 1 tlL Y UKLi VXjbW I TrJjNxYI 

SQ^iWKVKPKTiJ^OI^^K)MTLl^IAJ^^l FTftJVTCLDDVLK 

RTKGGKDIKFl*TAFPJ}LLFxTIJVFPVS^ 

lYPKVLDTVIPVWwniAMHT^^ 

TLIJtfVASraYISRILWLYFETC^ 

Y VF IAS I YIix*jEKI*NHWKWV8VQII^ 

PAKHQLVKNIR 


5521 


546 


911 


KILNMQKSCKKNEGKPQNMPKAEEDRPtBDVPQEAEGNPQPSHE 
GVSQFABGNPRGGP1^PG<^FKEDTPVRHLDPKBMIRGVDKI 4 ER 
LREE IRRVRNKFVMMHWKQRHSRSRP YPVCFRP 


5522 


1224 


637 


GSRPJ^RSREKMWVFGYGSLIWKVD5WQDKLVGYITWYSRRF 
WQG STDHRGVPGKPGRVVTIjVBD PAG CVWGVAYRliPVGKEEEVK 
AYIiDFREKGGYRTT^VIFYPM^PTTTPFSVI^ 
PAPLEDIAEQI FNAAGPSGRNTBY^FBLANSIRNLVPEKADEHL 
FALEKLVKBRLEGKQNIJJCI 


5523 


3 


1280 


^KGkKR^SSMSAATARRPVFDDKEDVNTOHFQILRAIGKGSFG 
KVC I VQKRDTEXMYAMKYtWKQQC I ERDEVPOfVFTiEIiEILQBIE 
HVFLVNLWYS FQDEEDMFHVVDLLLGGT)LRY1ILQQNVQFS 
RLY I C BMALALD YLRGQHI I HRDVKPDNXLLDERGHAHLTDFNl 
ATIIKIXJBRATALSGTKPYMAPEIFHSFVNGGTGYSFBVDWWSV 
GVMAYEIJ^GWRPYDIHSSNAVESX*VQ1^STVSVQYVPTWSKEM 

vallr ku^Upbhrlsslq d vc^palagvlwdrt»sr kr pg 
fvpnxgrxihcdptfeleem i liesrplhkklckrlaknksrdns rd 

SSQSBNDYLQlK^^IG^DFVIFWREKx.KRSQDLPREPLPAPES 
RDAAEPVEDEAERSALPMCGPICPSAGSG 


5524 


85 


2318 


RERERT)HRPGBSSQGQ3GAGGCFPSP7ME1iRCGGLLFSSRFDSG"" 

NLAHVEKVESLSSIDGEGVGGGASALTSGIA6SPDYEFNVWTRPD 

OurrKFEWSNRSWFYFSVRGGMPGIM 

MAPFVRTLPTRPR WER I RDR PTFEMTETQFVLS FVHRFVEGRGA 

TTFFAFCYPFSYSIX^ELLRQIjJ3QRFPEt^ 

KIJ!»CYSJUDGLRVDl^TITSOTGLRBDREPRJ r «E^ 

RFAGKRIPFl»SSRVHl 3 GBTPSSFVPNGFLDFIIJiPDDPRAO/ri^ 

KJbr Vr JvAtA FMUS FIXs V VJ^HYKTDoRGVNLhRQ YliKPD 

IYGAKAVl^YHHVHSRIiNSQSSSEHQPSSCLPPDAPVSDLBiaUJ 

NxXJHEAG£XsHSADRHNABAWK^ 

ESAPDTI PPKl^GVAxTVDLHGHASKRGCFMYGNS FSDESTQVE 
NMLYPKLlSLNSAHFDFO<X!NF 

AI YKASGI raSYTLECtryWTGRS VKS I PAACHDNGRASPPPPPA 
RAWMLKHVRNSRGL^TLNVGVW 

NTLSRARSFSTGTSAGGSS33QQNSPQMKNSPSFPFHGSRPAGL 
PGLGSSI^KVTHRVI^PVRGKPVWEPIXJHVFG^ 


5525 


105 


834 


SiraJ)FERHLFIMGQQISIX}TQLVl^ia>PEKVAKHVTLVR^ 

LTYEBFI^VAELWDVTAKVASGQEKHIjLFEVQPXJSDSSAPWKV 

VVRVVCTKINKSSG1 VKASRIMNLYQF IQLYKDITSQAAGVLAQ 

SSTSEEPDBNSSSVTSCQASLWMGRVKQLTDEEECCrCMDGRAD 

IiII^CS^FCQKCIDiCWSDRHRN^ 

TEDDMANYILNMADEAGQPHRP 


552S 


3 


B53 


RRPCNPVRAAKRTGAAARA PRGLEVTMIiR VAWRTI*S Ii 1 KTRAVT 
QVLVPGLPGGGSAKFP Fl^WGLQPP^LlJWiAAP^YVVRKPAQSR 
IJ3DDPPPSTI^KDYQNVPGIEKVDDWK^^ 
KQEQFMKK1AZANPHDTRSLEARI IALSVKIRSYEEHLEKHRKDK 
AHKRYLI^SIlX3RKJG41jK1SILRN^ 
iTfltRAHRRFVlTCKAiClRVF 
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SBQ 
ID 

NO; 


Pare dieted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanane, C=»Cysteine, D=Aspar tic Acid, R= 
Glutamic Acid, Phenylalanine, G=Glycine, ') : 
H=Histidine, I=Isoleucine, K>I»yaine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=*Proline, Q=Glutaraine, R^Arginine, 
S=Serine, T=Threon±ne, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Dnfcnown, +=Stop 
Codon, /-possible nucleotide deletion, 
\ -possible nucleotide insertion) 








RRNPDSPAKAIPKTLKDSQ 


5527 


3225 


565 


LLRXYIJ,HQNPI^LRHQPNRTCISFSAT^ 

KFQTKG I KWG KWKEVK IDPNN FADGQMDDLVCFE B LTD YQLVS 

PAKNPSSLFSKEAPKRKAQAVSEEEEEBBGKSSSPKKKIKLKKS 

KNVATEGTS TQ KE PBVKD PELEAQGDDM VCDDPEAGEMTSBNLV 

QTAPKKKKNKGKKGLBPSQSTAAKVPKKAKTWI PEVHDQKADVS 

AW 10DLFVPRP VLRALS PLGFS APTP I QALTliAPAIRDKIiDI LGA 

AETGSGKTLAFAIPMIHAVIjO^QKRNAAPPPSKTEAPPGETR^ 

AGAKl^PGKAKABSDALPDDTVIESEALPSDIAAEARAK^^ 

vsdqallpgdddagegpsslire KPVPKQNENEEENIJDKEQTGN 

LKQEIiDDKSATCKAYPKRPIiGLVLTPTRELAVQVKQHIDAVAR 

FTGI KT?OI,VGG^TQKQQRMI^RRPEI WATPGlU>WEIiI KEKH 

I xiJUKTi LixCy JJtCv, U V VlJl&fVL>«wlVOA.VjJ»CrfVI>^ 

RQTLVFSATLTIiVHQAPARILHiaanrKKMDKTAKLDLIiMQKIGM 
RGKPKVIDLTRNEATVETLTETKIHCETDE10)FYLYYFIjMQYPG 
RSLWANSISCIIOH^GljIjKVroiMPI.T^ 
FARLBXK^^IATDVAARGLDIPKVQHVIHYQVPRTS E I YVHRSG 
RTARATNBG3USLMLIGPEDVINFKK1 YKTLKKDEDI PLFPVQTK 
YMDWKERI RLARQIEKSEYRNFQACLHNS WI EQAAAALEI KLB 
RnwrvVfimmnrv^CPD d i> rnrr\M in/r jtwv .u ut . t.ar* dt .itthq/^ vn*w 

owl I PXf u A/U^UUJSBKK1C|2 AJ^PI R. V iK HJUJjoU *■ -L) LI ISoyiV. J. It 

YP TQSGKPPLLVSAPS KSBSALS CLS KQKKKKTKKPKEPQPEQP 
QPSTSAN 


5528 


3 


895 


GPFLSACRMWGACKVKVHDSLATIS ITLRRYLRLGATMAKSKFB 
YVR D F^ADDTCLlAHCWVWRLDGRNFHRPAE khnfakpnds ral 

FMTHVASQFASSYVFYWRDYFEDQPLLYPPGF1X5RVVVYPSNQT 
LKDYX^WROADOTINNLYNTVFWALrQQSGLTPVQAQGRLQ^ 
AAD KNEILFS EFNIN YNNE PPM YRKGTVL I WQKVDEVMTKE IKL 
PTEMBGKKMAVTRTRTKPCKPSHLPRAPCLRWL 


5523 


48 


640 


TPRLVSAHIjKTRiCLINPEAAERRWRDWDSHQGWLSVKMQRVSGL 
LSWTLSRVLWMGLSEPGAARQPRIMEEKAIiEVYDLIRTlRDPE 
KPNTLEELEWSESCVEVQEINEBE YLVI I RFTPTVPHCS LAXL 
IGLCLRVKLQRCLPFKHKIjBIYISEGTHSTEEDINKQINDKERV 
AAAMENPNLRE I VEQCVLEPD 


5530 


4541 


2606 


AQIVHAISYCfflCLHVGHRDLRTENWFFB^ 

FQPGKKLl^SCGSlVVYSAPKILIiGDEYDAPAVDlHSUJVILFML 

VCGQPPFQEAm»BTLTMIMIX3CYTVPSHVSKBCKDU 

DPKRRASIiEE I ENHPWLQGVDPSPATKYNI PLVSYKNLSBEEHN 

SIIQRMVL/^DIADRDAIVEJU^ETNRYNlIITATYFI^IJlERILRKK 

QEKEIQTRSAS PSNIKAQPRQSWPTKI DVPQDLEDDLTATPLSH 

ATVPQSPARAADSVLNGIffiSKGLCBSAKKDDLPBIJ^ 

VTNRLTSRKSAPVIiNQ I FEEGKSDDEFDMDENLPPKLSRLKMNX 
ASPCTVHKRYHRRKSQGRGSSCSSSBTSDIJDSFSRRRLDKDSGF 
TYSWHRRDSSKGPPGSEGDGGGQSKPSKASGGVDKASPSENNAG 
GK^PSSGSGGNPTNTSGTTRRCAGPSNSMQLASRSAGELVBSIjK 
L^lX!LGSQLiIGSTKYIIDPONGl^PSSVKVQBKS'IWMClSST 

gnagqvpavggikffsdhmadtttbi,erik^ 
ektisvntqrnpkegllcasspascaivi 


5531 


24 


515 


GSQPRAPRPRDSRERPEPELIRQSWRAVSRSPLRHG'rVLFARLF 
ALEPDlXPDPQYNCRQPSSPglK^fiSPEFLDHIRKVMLVlDAAV 
TNVEDLSSL£EYLASLGRKHRAVGVKI^SFSTVGESLLYMLEKC 
LGPAFTPATRAAWSQLYGAWQAMSRGWDGB 


5532 


3395 


1402 


SDWMVVGKRKMI lEDBTBFCGEBkLHSVLG£KSVFDVLDGEEMR 
RARTRAN P YEW I RGVFFLNRAAMKMANMDPVFDRMFTOPRDS YG 
KPLVKPRKAKl^YFADVCAGPGGFSKyVI,WRKKHHAKGFGMTLK 
GPNDFKLEDFYSASSELFBPYYGBGGI1X5IX5DITRPENISAFRN 
FVLDNTDRKGVHFIJ<ADGGFSVE^ 

LS rVRTGGHFlCKTFDLFT P FS VGLVYLL YCCFERVCLFKP ITS 
RPANSERYWC KGLKVGIDDVRDYXPAVN I KLNQ LRNTDSDVNL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine , ocysteine, D=Aspartic Acid, E= 
Glutamic Acid, F^Phenylalanine, G=Glycine # 
H=Histidine, I^Isoleucine, K*-Lysine, 
L=Leucine, M=Methionine, N~Asparagixie , 
P-Proline, Q=*Glut amine, R=*Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X=DnJcnown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








VVPLEVIKGDHEFTOYMIP^NESHCSMIKAL^^ 

SBPROAE I RKECLRLWG IPDQARVAPSSSDPKSKFPELIQ6TB1 

DIPSYKPTLLTSKTtiBKIRPVFDYRCMVSGSEQKFLIGLGKSQI 

YTWDGRQSDRWI KLDLKTEbPRDTI/LSVE IVHELKGEGKAQRKI 

SAIHILDVLVI^TDVREQHra^ 

PIRVKBVYRI»BKMEJttFVKl.EMKI IKGS5GTPXLSYIGRDDRHF 
VPMGLYIVRTVNEPWTMGFSKSPKIOT 

IAP FH1 CYYGRL FWEWGDG IRVHDSQKPQDQDKLSKED VI>SFIQ 
MHRA 


S533 


94 


789 


^ERRAPQPWARCKLVLVGDVQCGKTAMLQVIJVKIXr^PETYVP 
TVFENYTACX^TBKQRVEI>SLWDTSGSPYYDNVRPLCYSDSDAV 
LLCFD I SRPETVDSALKKWRTE I LDYCPSTRVIiL I GCKTDLRTD 
LSTIiMEI^HQKQAPISYEQGCAIAKQLGPEIYLEGSAFTSEKSI 
ilSlFRTASMLCLNKPSPLPOKSPVRSIiSKRLLHLPSllSBLISPT 
FKKKKAKXCBIM 


5534 


3 


605 


LVRGRARAANPGRVGAMIXJLR^VEHFLEQRNLVTEVLGAIiEAK 
TXJVTBCRYIiAAGAVTIjIjSLYI^ IK 
AIBSPSKDDI>TVWLTYWVVYAliFGLAEFFSDIiLIjSWFPFYYVGK 
CAPliI*FCMAPRPWNGAriWI»YQRVVR PLFLRHHGAVDRIMNDLSG 
RALDAAAGITRNVKPSQTPQPKDK 


5535 


1029 


3*2 


^PMDSEARI^SLVEI^in^D^KSDSENEDLKIDCLQESQEL 
NLQKLKNSERI LTBAKQKMREtiTVN I KMKBDL I KEblKTGNOAK 
S VSKQYTLKVTKLEHDAEOAKVBLTETQI^LQEljENKDr^ DVAM 
KVKLQKEFRKKVDAAKLRVXJVIX3KK0^ 

ELEQSVDHMKYQKIQLQRKLQEENRKRKQLDAVIKRDQQKIKVI 
LSYX PAKYNMKC 


5536 


942 


282 


AAATAASLS PRGCPXRTPSSDVS PSRA?PPSAAPLPTGRAQMSP 
SGRI/^TXVGLILPTRGQTIja>TTSSSSADATlMDIQVPTRAP 
DAVYTEIjQPTSPTPTWFADETPQPQTQTQOIiBGTDGPLVTDPET 
HKSTKAAnPTDDTTTLSERPSPSTDVQTDPQTLKP8GFHEDDPF 
FYDEHTLR FGiGfcLVAAVLF ITGI I 1 LTSGKCRQI*SRLCRNHCR 


5537 


3 


2391 


WW » tr\iUK V S ICO ksivf JvivLuC V La X JH K x a vMXitixuiXy L\iKr VAR,I 

PGHPGSWEMGLLTPRDVAVEFSLEEWEHLEPAQKNLYQDVMLEN 
YRNLVSLGLWSKPHLITFLHURKKPWKVKSKETVAIQPDVPSH 
YWKDLLTEHCTBAS FQKVI SRRHGS C3>LENLHIaRKRI9KREECEG 
HNGCYDEKTFKYDQPDBSSVESriPlKXiII*SSCAK3 
PTHSSL1^0X}KEIDIWGKHHIYDKT3VLPP^STLNSYRNVPIG 
EKNYHCNNSEKTI^QSSS PKNHQEEfy FLEKQYKCKE F3EVFLQS 
MHGQBKQEQSYKCNKC^EVCTQSIjKH I QHQTIHIRENSYS YWKY 
D XDLSQSSMLRKQ 1 1 HNEBKPY KCEKCGDSLNHSLELTQHQ IIP 
TBEKP YKWI03C3GKVFNLKCSLYLTKQQQ IDTGEKLYKCKACS KS 








EKPYKrKECGKAFNRSSCLTQHQTTHTGEKLYKCKVCSKSYARS 

SNLIMlIQRVHTGBJ^YroCKECGKVFSRS 3CLTQHRXIHTG 

KOCVCAKPFTCFSNklVHERIHTGEKPYK^^ 

RHRRIinGEKPYKCKACSKSFSDSSGLTVHRRTHTGEKPYTCKK 

CX5KAFSYSSDVI01u^lHTGQRPYKCEECGKA^NYP^YI»TTHQR 

SimJBRPYKC^ECGKAFNSRSYLTTHRPJmTGERPYKOJECGKA 

FSYRSYLTHIPJcSHSGERPYKCEEOGKAFWSP^YLlAHQR^HTR 

BKL 


5538 




161 


HSMWI KIP WGS I PVI MT.T.T.T.TiRT.Tr»T *?f* Juttl SICTCl DP & T POT PTS 

I PGTPGPDGOPGTPGI KGEKGLPGLAGDHGEFGEKGDPGIPGN P 
GKVGPKGPMGPKGGPGAPGAPGPKGESGDYKATQKIAFSATRTI 
WVPLRRDQTIRFDHVI TNMNNNYEPRSGKFTCKVPGLYYFTYHA 
SSRGNLC\WLMR(^RERAQKVVTFCX>YAY1JT FQVTTGGMVLKLEQ 
GENVF1X}ATDKNSLIX2MEGANS I PSGFLLFPDMEA 


5539 


38 


1258 


HRGPSGAAAPGCALPRGOJUiF^PRSCF^PQPMARRYDEIiPHYPG 
I VDGPAAIAS FPETVPAVPGP YGPHRPPQPLP PGI4DS DGLKREK 
DEIYGHPLFPLlALVTEK^rATCSPRJX^GAGIXSTPPGGnVCS 
SDSFNEDTAAFAKQVP^E3iPLFSSNPEIJ)NL\rtOAlQVIjRFHIi 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=»Histidine, I=Isoleucine, K=Lysine, 
^Leucine, M=Methionine, N=Asparagine, 
P^Proline, Q=Glut amine, R^Arginine, 
S=Serine, "^Threonine, V=Valine, 
HaTryptophan, Y=Tyrosine. X^Unknown, *~stop 
Codon, /=possible nucleotide deletion, 
Vpossible nucleotide insertion) 








ET^KVHDiO)NFaiRYI TCIiKGKMPIDLVIBDRDGGCREDFEDY 
PASCPSLPDQNKMWIRDHEDSGSVHLGTPGPSSGGIASQSGDNS 
SDQGDGLDTSVASPSSGGEDEDLDQERRRNKKRGIFPKVATNIM 
RAWLFQHLSHPYPSEEQKKQLAQDTGLTILO^HNWFINARRRIV 
QPMIDQSNRTGQGAAFS PBGQPIGGYTBTQPHVAVRPPGS VGMS 
LNI.EGEWHYL 


5540 


148 


1440 


PPr/3AGAGVHARSPHPARRIiPI*TTAGVGGRAPDLLPTPWRQHRG 
PSGAAAPGCALPRGQAI^PRSCRRPQPMARRYDKLPHYPGIVD 
GPAALASFPETVPAVPGPYGPHRPPQPLPPGIiDSDGLKRBKDEI 
YGHPLFPLLALVFEKCEIiATCS PRDGAQAGLGTPPGGDVCSSDS 
Fl^DNTAFAKQVRS^PLFSSNPELDNLMIQAlQVIiRFHIiliELE 
KGKMPltoLVimRDGGCRKDFBDYPASCPSLPDQmTMIRDHBD 
SGSVHLGTPGPS SGGLASQSGDNSSDQGVGIJDT3 YASPSSGGED 
EDMX)EPRRNKKRGI FPKVATNIMRAHLFQHLSHPYPSEEQKKQ 
IAOJWGLTII^Vl^PIliAI^ 
QPIGGYTETEPHVAFRAPAS VGDEFGTR KKEWHYL 


5541 


148 


1440 


P PLGAGAGVHARS PHPARRLPLTTAGVGGRAPDIiLPT PWRQHRG 

PSGAAAPGCAI^RGQAU2GPRSCRRPQPMARiiYDELPHYPGIVD 

GPAAIASFPETVPAVPGPYGPHRPPQPLPPGLDSDGIiKREKDBX 

YGHPLFPLLALVFBKCELATCS PRDGAGAGLGTPPGGDVCSSDS 

F>reDOTAFAKC3VRSERPLFSSl^ELE*niMIO^0 

KGKMP IDLVI EDRDGGCREDFED YPAS CPS LPDONNXW IRDHED 

SGS VHLGTPGPSSGGIiASQSGDNS SDOGVGLDTSVASPSSGGED 

EnijDQF^RRNK^G I FP ECVATN IMRAWIiFQHIjSHP YPS EBQKKQ 

LAQDTGIjTIIXJVIJNWFINAKRRIVQPWIDQSN^ PEG 

QPIGGYTE^PHVAFRAPASVGDEFGTRKEEWHYL 


5542 


148 


1440 


PPLGAGAGVHARSPHPARRLPLTTAGVGGRAPDIjLPTPWRQHRG 
PSGAAAPGCAI^RGO^BGPRSCRRPQPMARRYDEIiPHYPGIVD 
GPAALASFPEWPAVPGPYGPKRPPQPLPPGIjDSDGLKRBKDBI 
YGHPLFPLLALVFEKCEIJVrCS PRDGAGAGLGTPPGGDVCSSDS 
FT^OTAFAKQVRSERPLFSSNPELDNLMIOAIQVLRFHLLELR 








SGSVHIGTPGPSSGGLASQSGDNSSDO<^G1jDTSVASPSSGGKD 
EDU)QEPRRNKKRGI FPK^TATNIMRAWLFQHLSHPYPSEBQKKQ 
l^DTGLTILQVNNWFimRRRIVQPMI^ 
QPIGGYTKTBPHVAFRAPASVGDBFGTRKEEIfHYL 


5543 


2405 


665 


RVfVREQPWPIiRTSEAVKTPALRPFPGPRGVSPFPKPDVJGKSPAP 

KRPFSDSGAFWS PKRRPGVIiEAPRRR PVPAS FRAVP PKPTRVHG 

SSASRDRVLARTMIVADSECRABXKDYlJtPAPGGVGDSGPGEEQ 

HBSRARRGPRGPSAPIPVEWIJ^GAJKr^O^I^t^AU^SGRV 

DNLAWMGLHPDYFTS FWRLHYIJJliHTtX3PlJ\S S WRI nflAIMAA 

ARHOCSYLVGSHMAEFLO/TGGDPEWLLGl^ 

LLABTlP^ITKEHIOJVIiIilCKSEm^ 

SFVFGCGILPEGDADGSPAPQAPTPPSEQSSPPSRDPLNNSGGP 
ESARDVEALMERMQ^JWESLL^^ 

VTPSADILBPS PHPDMtiCFVBDPTFGYRDPTRRGAQAPPTFRAQ 
DYTWEDHGYSLIOJU.YPEGGQIXDE1CFQAAYSLTYNT1AMHSGV 
DTSVUiRAIWy r IHa/FGlRYDDYDYGKVIK)iaiERNIiKVYIKTV 

RA1TRYMT 








5544 


1895 


514 


LGGLX<GRQRLLIJttKni^^ 

PEGRRGEPLRRRASSASVPAVGASAEGTRRDRLGSYSGPTSVSR 
QRVESLRK3CRPLFPHFGLDIGGTLVKLVYFEPKDITABEEEEEV 
ESLKS I RECYLTSNVAYGS TGI RDVRLE LKDLTLCGRKGNLHFIR 
FP^HDMPAPIQMGRDKNFSSLnTVFCATGGQAYKFEQDFLTIGD 
I^LCJOJ)ETiDC^IKGILYIDSVGFNGRS(X^YFENPADSEKCQK 
LPFDLKNPYPLLLVNIGSGVS IliAVYS KDNYKRVTGTSLGGGTF 
FGLCCUjTGCTTFF^ALFJ^RGDSTKVPKLVRDI YGGDYRRFG 
LPGWAVASSFGNMMSKEKREAVS KEDLARATklTITNNIGS IAR 
MCALWENINQWFVGNFLRINTIAMRIJjAYAI^YWS^ 

sehegyfgavgallellkip 
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SBQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corre sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(AsAlanine, C=Cysteine f D=*Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H-Hietidine, I^Ieoleucine, K=Lysine f 
L»Leucine # H«Methionine, N-Asparagine, 
P=sProline, Q«Glutamine, R=Arginine, 
S»Serine, T= Threonine, V- Valine, 
W=Tryptophan, Y^Tyrosine, X=*Unknown, *=«Stop 
Codon, /^possible nucleotide deletion, 
\ ^possible nucleotide insertion) 


S545 


S02 


131 


GAMWSAGRGGAAW P^I*LGLLIALLVPGGGiUUQX3ABLVTCGS VL 
RLLNTHHRVRLHStlDX KYGSGSGQ^S VTGVEASDDANS YWRIRG 
SSEQGCPRGSPVRCX^VRLTHVLTGKNLHTHHFPSPLSNNQEV 
SAFGEIX^DDIiDLWTVRCSGgHWERKAAVRFOHVGTSVPIjSV'r 
GBQYGS P IRGQHEVHGMPSANTHNTWKAMEGI F I KPSVEPSAGH 
DSL 


554<T 


1592 


146 


PVPRGG3SS MGQSGRS RHOKRARAQAQtRNLBAYAANPHS FVFT 

RGCTGRNTRQLSLDVRRVMEPLTASRIjQVRK 

LGVTHFLI I*S KTETNVYFKLhKtfjPGGPTIjTTQVKKYS IjVRDWS 

SIiRRHRhmKQOFAHPPLLVLNSFGPRGMHViajMATMFQ^ 

NVHKVNIOTIKRCUjIDYMPDSG^I^FR^ 

KIMBXFPi^RUiDISELIAT^^ 

AVAGRGNMRAQQS AVRLTE IGPRWTLQLI KVQEGVGEGKVMFHS 
FVS KTBEELQAI LBAKBKKLRIiKAQRQAQQAQNVQRKQBQREAH 
RKKfiLBGMKKARVGGSDEKABGIPSRTASLELGEDDDBQBDDDI 
ETxFCQAVGEAPSEDLFPEAKQKRLAKSPGRKRKRWE^RGRGRL 
CX^KFPKTKDKS(^AOARRGPRGASRDGGRGRGRGRPGKRVA 


5547 


1592 


146 ""' 


FVPRGGHSSMGQSGRSRHQKRARAQAQIjRNLEAYAANPHS FVFT 
RGCTGRNXRQI>SI»DVRRVMKPLTASRIjQVR kknslkdcvavagp 
lgvthpi^lsktetnvyfkij^pj^pggptltfqvr^ 
slrrhrmheqqfahp piii#vlnsfgphgmhvklmatmfqnlfps i 

^^^KVNI2T^I KRCIJlilDYNPD SQBLDFRITy S I KWPVGA3RGMK 
KLI^KKFPimSRIiQDISBLLATOAGl^BSRAEPDGDHNlTBl^ 
AVAG^GNMRA^SAVRLTEIGPRmi^LTKVQKGVGEGKVMFHS 
FVS KTEKKI^Ari^KAKEKKLRUCAQROAQQAQNVQRKQEQREAH 
RKKSLEGMKKARVGGSDEEASGI PSRTASIiKLGEDDDEQEDDDI 
EYFCQA^EAPSEDI^PRAKQKRLAKSPGRKRKRWEMDRGRGRI. 
CDQKFPKTKDKSQGAQARRGPR<^RDG<3RGRGRGR 


5548 


1 


2153 


DQ^PPBTIAFrFPP^TMEPI*C^IaVXW 

DSNETTTTSGPPDPGASQPkLAWLLLPLIA^ 

RKQRKAWSTSDKKMPNGILBEQEQQRVMLLSRSPSGPKKYFPI 

PVEHLBEEIRIRSADIX^KQFREEFNSLPSGHIO^TFEIiANKBEN 

REKNRYPNIIxPlTOHSRVIIiSQLDGIPCSDYINASYIDGYKFKNK 

FIAAQGPKQKTVWDFWRMVWEQKSATIVMLTNLKEPJCBBKCHQY 

WPtKX?CimGMIRVCT/EI>CVVI.TO^ 

LVSQLHFTSWPDFGVPFTPIGMLKFLIUCVKTIWVHA^ 

8AGWRTt?rFI VIDAWIAMMIAEXJKVDWBFV3R I RNQRPQMVQ 

TDMQYTF I YQALLEYYIjYGDTBIiIJVSS LEKHLQriTIHGTTTHFPK 

IGLBEKFRKLTNWIMKFJmRTGl^PANMKKARVlQl IPYDFNR 

VTI^MKRGQ^YTDYINASFIDGYRQKDYFIAIXJGPLAHTVKDFW 

RM I WEWTCSHTI VMI .TKVQRREQDKCYQYWPTEGSVTHQE ITI EI 

KNDTLSEAISIRDFLVTliNQPOARQEEQVRVVRQFHPHGWPEIG 

IPAEGKGMI0LIAAVQKG^G^X3W1PITVH(^AGAGRTGTFIAL 

SNILERVKAEGLLD VFQAVKSLRLQR PHMVQTLEQ YBFCYKVVQ 

DPIDI FSDYANFK 


5549 


915 


256 


FEATGGKRIAFKMAGTAR^ 

CLARGSAGIKGLGRVFRIMDDI»INP^LDFKBFMKGLNDYAVVME 
KF^VEELFOJRFDKDGNGTIDFNEFILTLRPPMSRARKEVIMQAF 
RKLDKTGDGVIT I EDLREVYNAKHHPKYQNGBWS E BQVFRKFLD 
NFDSPYDKDGLVTPBBPMNYYAGV3AS IDTDVYPI IMMRTAWKL 


5550 


2364 


1210 


RKRKVFl»KMRRIiKRKICTIiSLVKBI»D^ 

SLIAFTTMALIjTIMEFSVYQDTWMKYKY^VDKDFSSKLRINIDI 
TVA^CQYVGADVI^T^TMVASAIX3I>VYEPTVFDT^PCi2KEHQ 
RMLQIi I QSRLQBEHSLQDVI FKSAFKS TSTALP PREDDSS OS PN 
ACRlHGHLmntVAGNPHITVGKAIPHPRGHAHIAALVm^ 
FSHRlDHLSPGBLVPAI E^LDGTIWIAIDHNQMFQYFITVVPT 
KlilfTYKISADTHQFSVTBRERI INHAAGSHGVSGIFMKYDLSSL 
MVTVTEFJlMPFHQFFVPJiCGIVGGlPSTTGMLHGIGKFIVEI IC 
CRFRLGSYXPWSVPEEJOrn>NRliPIiLENNTH 


5551 


211 


1700 


MORDHTWDYKBSCPSVSIPSSDBHP^KIOaiPTVYKVLVSVGRSB 
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SEQ 
ID 
NO; 


Preciicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=Alanine, C= Cysteine, D^Aspartic Acid, 3= 
Glutamic Acid, F=phenylalanine, G=*Glycine, j 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, rt-Methionine, N^Asparagine, 
P^ProIine, Q-Glutamine, R=Arginine, 
S»Serine, T»Threonine, V^Valine, 
W«=Tryptopban, Y*-Tyrosine, X^Onknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide Insertion) 








WFVFRRYABFDKLYNTLKKQFPAMALKIPAKRIFGDNFDPDFIK 

QRRAGt^PIQNLVRYPELYNHPDVRAFLQMDSPKHQSDPSEDE 

DERSSQKLHSTSQN INLGPSGHPHAKPTDFDFLKVIGKGS FGKV 

LLAKRKl»DGKPYAVTCVLQKKJ[VliEIRKEQKH 

PFLVGIJIYSF^TTEKLYFVIJDFVNGGBLPPHIjQRERSFPKIIRAR 

FYAAEIASAW3YU?S I KXVYRDLKPEN 1 LLDSVGHVVLTDFGLC 

KEGIAISDTTTTFWTPKYTiAPEVIRKQP 

EMLYGLPP FYCRDVAEMYDNI LHKPLSLRPGVSLTAWSILEELL 

EKDRQNRLGAKEDFLE IQNHPPFBSLSWADLVQKKIPPPPNPNV 

AGPDDIRNFDTAFTEETVPYS VCVSSDYS IVNASVLKADDAFVG 

FS YAPP SEDLFL 


S552 


2740 


930 


IXJPAAGAAMGKKHKKHKAEWRSSYEDYADITC^ 

SSVTEI^GSGHDSSY^DDRSDHESlERHlffilQOaatiOtKSEKEKHL 

DDEBRilKRJCEEKKRKRRREHCm'BGEADDFDPGKKVKVEPPPDR 

PVRAC3nX}PABNE$TPIQQLLEHFLRQI^^ 

IAPGYSMI I KHPMDFGTOKDKIVANBYlCSVTEFKAD^ 

MTYNRPDTVYYKLAKKI UIAG FKMMSKQAALLGNBDTAVEBPVP 

EWPVQVBTAKKSKKPSREVI SCMFEPEGNACSLTDSTAKEHVi 

AIiVEHAADEARDRINRFLPGGKMGYIiKRWG^ 

DADKBETHP VDLSS I>S S KI JjPG FTTDG FKDERRNKVTPLSSATT 

ALSMQNNSVFGDLKSDEMEIjL YS AYGDBTG VQC3\IiSIjQE FVKDA 

GSYSKKVVDM^LDQITGGDHSRTfcFQL^ 

DTJUJDSSSSVIiEFWSMKSYPDVSVDISMLSSLGKVKKELDPDDS 

H3^,DFrrTKLI^DLHEAQAERGCSRPSSNl>SSl>SNASERD0HHL 

GSPSRISVGBQPDVTHDPYBFLQSPEPAASAKX 


5553 


74 


1095 


LGREAVYLVS RMDGPVAEHAKQE PEHWTPI»I#ESWAl»SQVAGMP 
VFLKCENVQPSGSFKIRGIGHPCQEMAKKGCRHLVCSSGGNAGI 
AAAYAARKLGIPATIVLPESTSLQVVQRLQGEGAEVQLTGKVWD 
EANLRAQELAKRt)GWBNVPP FDH PI*I WKGHASLVQELKAVLRTP 
FGALVIAVGGGGUJtf3VVraU^VGWQ 

A 1 TAGKLVTL PDI TS VAKS LGAKTVAARALBCMQVCKIHSEVVE 

IYfEAVSAVOSU^IffiRlfoVBPAOGAAl^^ 

PPSLTSVWIVCX3GNNINSRELQALKTHLGQV 


5554 


166 


2319 


CSGR1X3GRGSI*RPAENVC^TCKLSGAETRGLLCPAIJ?TWIMKVT> 

GRSFFWVLFPVLPWAVQAVEHEBVAOJRVIKIMGRGVAAMQSRQ 

WVRDS CRKLSGLLKUKNAVLNKL KTAIGAVR KD VGLSDEEKLFQ 

VHTFEI FXiKKI^ESENSWQAVYGLQRAlJQGDYKDVVmKESSR 

QRI'EAIiREAAI KEETEYMEI JjAAEKHOVEAI)KNMQHC!NQSLSML 

DEII£DVRKAADRLEEBIEEHAPI)DNKSVKGVNFEftVLRVEEBE 

AKSKQNITKREVBPDLGjbSMIilDSCJ^QYTLTKPRDSTIPRADH 

HFIKDlVTIGMbSLPCGWLCTAIGLPTMFGYI ICGVUjGPSGLN 

SIKSIVQVETI^KFGWFTLFLVGLEFSPKaJU^ 

YtflXIiMIAFGLLWGIflJ^IKPl^ 

GSARGDKESDIDYSTVIiLG^VT^ 

ASSS I WEVLR ILVI,TGQI LFSLAAVFLIi CLVI KKYLIG P YYRK 
LHMBSKGJ^lLILGlSAFIFt^TVTELI^^ 

vs3cgpwtre iats iepirdflaivffasiglhvfptfvayel 
tvlvfltlsvvvmkfllaalvlsw 

33PSFWiGSRARRAGVlSRBVYLLILSV^^ 
TR CVPRPERRSSL 


55S5 


212 


1425 


LSUi'mBTPAPPRCEAASQGRVG^PJWAAAEEAVP^VWNimiDR 

/wiirh nAitT f>tir*s~*T t r T irr T/*ntmh^nnoinrYT nTTT^Y* V\ CTifT\Tir 

GTMftPQ3Q.^TFCfl^LLItYIiIGAv^ 

F^YRKXALQLHPDPJ^DDPQAQEKFQDLGAAYEVLSDSEKJiKQY 

DTYGEEGLKDGHQSSHGDI FSHFFGBPGFMFGGTPRQQDRKIPR 

GSDIIVDLEVTLEBVYTiGHFVEVVRinCPVA^ 

MRXTQLGPGRFG^TQEVGCDBCPPTVKLVIJEERT^ 

GMRYPF IGKGEPmnX5KPGDLRFRIKVVKHPI FERRGDDLYTNV 

TISLVESLVGFBMDITHLDGHKVHISRDKITRPGAKLWKKBEGL 

PKFDNNNIKGSLIITFDVDFPKEQLTEEAREGIKQLLKQGSVQK 

VYHGLQGY 


5556 


5B35 


3346 


RTRGXSKNCVPMEFEEYIiIi^FQGTFYLIiQKITKDNNAHrVKSR 
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ID 

NO: 


beginning 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


raux.u-p auiu ocvjiucui' i,\Jiii_ti Aiij-ny si^ual ucpuXQc 

(A^ Alanine, c=Cysteine, l*=Aspartic Acid, B=» 
Glutamic Acid, r«Phenylalanine, G=Glycine, 
H=Histidine, Idsoleucine, K=Lysine, 
L=?Leucine r M^Methionine, N=Asparagine, 
P=Proline, Q=Glut amine, RT^Arginine, 
S«*Serine , T=Threonine , V^Val ine , 
W=Tryptophan, Y=Tyrosine, X=Unknown , *=Stop 
Codon, /^possible nucleotide deletion, 
\ -possible nucleotide insertion) 








LKELDESYIEKPTDFUa^SVHLRRIBSY$QFPV\mFLTI^iFK 

YTFHQPTIIKGYFSCIJJIWTLFLDyLTSKIKSRLGDKEAVLNRYE 

DALVLI>LTEVLNRIQF11YNQAQLEELDDETLDDD0QTEWQRYLR 

QSLBWAKVMEIiLPTHAFSTLFFVLQDNLEVYL^ 

GHRI^ITAENDCRRIiHCSI^DLSSLI^VGRIiAEYPIGDVFAAR 

FNDALTVVBRLVKVTLYGSQlKIjYNICTAVPSVIiKPDI.ro 

SIJUU^YSHWrAQYCSEVHRQNTQQFVTLISTTMDAITPLIST 

KVQDKLIJjSACHIiLVS LATTVRPVPLIS I PAVQKVFNRI TDASA 

IjRLVDKAQVLVCRALSN I LLLPWPNLPENEQQwPVRS inhasli 

SALSRDYRNLKPSAVAPQRKMPLDDTKLI IHQTLSVLEDIVENI 

SGESlTCSRQICYQSLQESVOVSLALFPAPIHOSDVTDEMIiSPFL 

TLFRGI^VQMGVPFTKQIIG/rPLNMFTREQIjAESII^HEGSTtX^ 

VVEKFLKIIKJVVVQBPGQVFKPFIiPSIIAIiC^ 

PDVKAELFEIiljPRTLHHNWRYFFKSTVLASVQ^ 

QFSAIMQAFGQSFIjOPDIHI^KQNLFYIJCTLOTKQKLYHKKTFR 








AFLPEFLTS OX^mANQKSVJ^l^FKP^VRPJERGRAKRRAElW 
RKPGTCAARRGHIEASGRGLCPPCSLAAAHEMPADLVL 


5557 


1712 


491 


VXIrfdAGLRDKDMWl P VVGLPRRI JtLS ALAGAGRFC1 LGS EAATR 

KHLPARNH CGLS DS S PQLtyPE PDFRN PPR KAS KASLDFJCR YVTD 

RRLABTLAQ I YLGKPS RP PHLLLECNPGPG I LTQALLKAG AKW 

ALKSDKTFIPHLKSLGKNLDGKLRVIHCDFPXIJJPRSGGVIKPP 

AhtSSROT,P}OJI/5IKAVpI^ADIPLKWGWPPSRGHKRALWKrAY 

DLYSCTSIYKFGRIEVNMPIGEKEFQKLMADPGNPDLYHVLSVI 

WQIACEIKVT^MBPWSSFDIYTRKGPI^PKRRBLU>QI^KLY 

LlC^rPRQNI^TK^TPMNYNIFFHLLKHCFGR 

LTPLDARDI U1QIGKQEDHKVVNMHPQDFKTLFETIBRS KDCAY 

KWLYDETLEDR 


5558 

- 


1509 


96 

- 


RAGCTH PQV PADLGAPAE PRR PQKTCVCLLQPQPGGQRG PTTM I 
TGVESMRLl^PVGVLTSLAYCLHQRRUALAELQEADGQCPVDRS 
liKlJCMVQVVPRHGARSPIiraPLPLEEQVEWNPQLLEVPPQTQFD 
YT^nWrjAGGPKPYSPYDSQYHETTLKGGMFAGQLTKVGMGX^MFA 
LGERLRKNY^DIPFI^PTFl^EVFIRSTHIFTlNLESTElCLIiA 
GLFQCQKEGPI I IHTDEADSEVLYPNYQ3CWSLRQRTRGRRQTA 
SI^PGISEDLXKVKDRMGIDSSDKVDPFILLDNVAAEOAHNLPS 
CPMLKRFARMIEQRAVDTSLYILPKEDRBSL0>1AVGPFLHILES 
NLLKAM3<^TAPDKIRKLYLYAAHDV^ 

FAVDLTMELYQHLES KBWFVQLYYIIGKEQVPPGGPDGT>CPLDMP 
LNAMSVYTLS PEKYHALCSQTQVMSVGNRB 


5559 


ISO 


1983 


PLAATAHFAKMSRVAKYRRQV3BDPDI DSLLKTLS PEEMEELEK 
ELDVVI>PIX3SVPVGl^GJWrEKQSTGVY2ffi 
LMQREMSMDE SKQVETKTnAKWGBERGRDASKKALGPRRDSDLG 
KEP KRGGLKKS F5RDRDEAGGKSGEKP KEEKI IRG I DKGRVRAA 








VKRGTGNTiJT ttXDDBKVKKNEPLHB KEAKDDS KTKTPKKQTPSG 

PTKPS KGPAKVEBEAAPS I FDBPLERVKNNDPBMTEVNVNNSDC 

ITireiLVRFTEALEFNTVVKJ^AIANTR^ 

KTITSLmJJSNHlTCKGlLAJCF^ 

GKTEMKIAKI^KKNTTIiLKljGmFEliAGPRMTVTNL^ 

RQKRLQEQROAQBAKGBKJCDLLEVPKAGAVAKGSPKPSPQPSPK 

PSPKNSPKKGGAPAAPPPPPPPIAPPLIMENLKNSLSPATQRKM 

GBK^PAQEKNSPlXJLIAArRSSNLKQLKKVEVPKLLQ 


5560 


9 


921 


SSVVEFSALSVSMACLSPSQLQKFQQDGFLVLKGPLSAEBCVAM 
QQR1GB IVABMDVPLHCRTEFSTQEEEQLRAQGSTDYFLSSGDK 
IREFFEKGVEDEKGNFLVPPEKSINKIGHALHAHDPVFKSITHS 
FKVQTLWtSIjGLQMPVVVOSM YI FKQPH FGGEVS PHQDAS FLYT 
EPl^RVI/JVWIAVEIWPLENGCLWFI PGSHTSGVSRRMVRAPVG 
SAPGTSFIiGSBPARDNSLFVPTPVQRGALVLIJIGEVVHKSKQNL 
SDRSRQAYTPHLMEASGTTWSPENWLQPTAELPPPQLYT 


5561 


2175 


1775 


CYFIFQFFSSPYPGLHPHQTPAPLPNPGLYPPPVSMSPGQPPPQ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


1 Predicted end 
1 nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A= Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine, G=Glycine f 
H—Histidine, I=»Isoleucine, X^Lyeine, 
L^Leucine, M=Methionine, U«Asparagine , 
P~Proline, 0=Glut amine, R=Arginine, 
S=Serine, ^Threonine, V=Valine, 
WeTryptophan, Y^Tyrosine, X-^unknown , *=Stop 
Codon, /=possible nucleotide deletion, 
N^possible nucleotide insertion) 








QLLAPTY PS APGVKNFGNPSYPYAPGALPPPPPPHIiY PNTQAPS 
QVYGGVTY YNPAQQQVQPKPSPPRRTPQPVTI KPPPPBWSRGS 
£ 


5562 


342 


1385 


SSG KNDMAAAGAAGL VRGL KAG VI»S QAD YLNLVQ CKTI^ED LKUrf 
LG^TDYGN EliAJfEASPLWS VIDDRLKEKMVVEFI^ 
l^FLDFITYSYMIDNVILLITGTLHQRSIAELVPKCHPLGSFE 
OMEAVNXAO/TPABL YNAILVDTPLAA I 
IRNTLYKAYIiESFYKFCnjjGGTT^ 

INS PGIT^KEDRAKLPPHCGRLYPEGI^LARADDYEQVKNVA 

DYYPEYKIAFEGAGSNPGDKTLHDRFFEHEVKl^^ 

GVFYAFVKLKBQBCRNIVWIABCIAQRHRAKIDNYIPIF 


5563 


342 


1385 


SSGKm»lAAAGAAGLVPXajCAGVJ^<^ 

UJSlTJYGNPLAMKASPLTVSVIDDRI.KEKMVVK FRHMRNICAYBP 

liASPLDFITYSYMlDNVILLITGTIiHQRSIABLVPKaiPLGSPB 

QMKAWIAG^PABLYNAILVirrPIAAPFQDCISBQDUDHWIEI 

IRNTLYKAYLESFYKFXTTIJjGGTTADAMCPIIiEF^ IT 

INSFGTririSK^RAKLFPHOGRLYPBGIuAQI^RADDYEQVKNVA 

DYYPKYKLLFEGAGSNPGDICITiEDRFFBHEVKtNlQlA^ 

GVFYAFVKLKKQECWIWIAECIAQRHRAKIDfnflPIF 


5564 


3 


914 


RVRRDKRAVWTARGRRR CGDSMSGGWWAQV(^WRTGAXGI«ALIxIi 
IJiGlXLGI^AAASPLSTPTSAQAAGPSSGSCP PTKFQCRTSGLC 
VPLTWRCDRDrOJCSDGSDEBBCRIBPCTQKGO^PPPPGLPCPCT 
GVSDCSGGTDKKLRNCSRIJVCLAGEIjRCTLSDDCI pltwrcdgh 
PDCPDSSDELGCGTNEILPBGDATTT^PPVTI^ 
GPFVTLESVPSVGNATSSSAGDQSGSPTAYGVI AAAAVLSAS LV 
TATI,LLLSWLRAQERJLRPLGLLVAMKESI>LLSEQKTSLP 


5565 


993 


138 


RWNSPNPARAGS ISRPQRAPGSVSAVAMTAAVFFGCAFIAFGPA 
LALYVFTI ATB PLRIIPLIAGAFFWI>VSIit>ISSLVWFMARVI ID 
NKDGPTQKYLLI FGAFVSVY IQEMFRFAYYKLLKKASEGLKS IN 
PGETAPSMRLLAYVSGLGFGIMSGVFS FVNTLSDSLGPGTVGIH 
GDSPQFPLYSAFMTLVI ILLHVFWGIVFFDGCEKKKWGILLIVL 
LTHL^V^AQrrFISSYYGINIiASAFZILVLMGTWAFIJU^ 
LKLCLIiCQDKNFIJ»YNQRSR 


5566 


2043 


1232 


SHIQHHGRGAQAP VKMVS WM I SRAWLVFGMLYPAY YS YKAVKT 
KNVKEYVRWWMYWIVFALYWIErrVADQ 

IWI^PYTKGASLIYRKFLHPLI^SKJBP^IDDYIVCAKFJJGYET 
MVNFGRQGLNIAATAAVTAAVKS(X2A1 TERLRSFSMHDLTTIQG 
DEPVGQRPYQPLPEAKKKSKPAPSESAGYGIPIiKDGDEKTDEEA 
BGPYSDNEMLTH KG PRRSQSMKSVKTCKGRKEVR YGSLKYKVKK 
RPQVYP 


5567 


1554 


233 


KFI^GVSPDIJ^EJDGLTAUIQCXriDDFREMVQOI^EAGANINA 
CDSKCWTPIJlAAATCGHIin»VELLIASGANr.lAVNTr)GNMPyDL 
CDDEG^PIJXS^ETAMADR^ 

AGADIJIAPIiDHGATLLHVAAANGFSEAAALU^KHRASI^AKDQD 
G WEPLHAAAYWGQVPLVEIJjVAHGADLMAKS lmdbtpldvcgdb 
EVRAKXiI*ELKHKHDALLRAQS RQRSIJjRRR tssagsrgkwrrv 
SLTQRTDLYRKQHAQEAI VWQQP P PTS PEP PEDNDDRQTGAELR 
PP PPEEDNPEVVRPHNGRVGGSPVRHLYSKRLDRSVS YQLSPLD 
SrrPHTLVHDKAHRTIADLKRQRAAAKLQRPPPEGPESPBTABP 
GLPGDTVTPQPIXX^FRAGGDPPLIja^TAPAVBAP 


5566 


X A34, 


587 


AKDRQPAS RRGAGTTAAMAAS GPGCRSWCLCPEVPS ATFFTALIi ~ 
5LtiVS(^RI»FLIjQQPIAPSGLTLKSKALRNWQVYRL FVYE 
NPISLLCGAI IIWRFAG^FERTVGTVRHCFFTVIFAIFSAIIFL 
S FEAVSS LS KLGEVEDARGOTPVAFA^1U?VTTVRSRMRRALVFG 
MVVPSVIiVpWLLLGASWLIPO/rSFIiS^ 

IDLSERVALIO^DGTFPFSLMRRISVFKYVSGSSAERRAAQSRKI» 
NPVPGS YPTQSCHPHLS PSHPVSQTQHASGQKIiASWPSCTPGHM 
PTLPPYQPASGIXrxVONHFGPHPTSSSVYPASAGTSLGIQPPTP 
VWSPGTVYSGAM5TPGAAGSKESSRVPMP 


5569 


2 


835 


QTPCPIAWtR^yKSEUISVPGQKPPTCSSFSGMDVGPSSLPHLG 



344 



WO 01/53312 



PCT7US00/34263 



SBQ 

x ? 

NO: 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C^cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine, G=Glycine, 
H«Histidine, I=Iooleucine, K=*Lysine, 
I>»Leucine, M ^Methionine, N=Asparagine, 
P=Proline, Q=Glut amine, R-Arginine , 
S=Serlne, T=Threonine, v=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








LKLLLLIjIit^LPLRGQANTGC YGI P 

KGEPGIPAIPGIRGPKGQKGEPGLPGHPGKKGPMGPPGMPGVPG 
PMG IPGEPGBRGRYKQKFQSVFTVTRQTHQPPAPNShlRFNAVL 
TWPQGDYDTSTGKFTCKVPGLTCFVYH^ 
VVTFCGHTSKTNQVNSGGVIiLRLQVGEE^ 
SDSVPSGFUiFPD 


5570 


264 


946 


RDRRDRGGVATSTEEPARPRAPQSRGPGPVSQTGRGRERGGGDT 
MSSPSPGKRRMDTDWKIi lESKHBVTILGGLHEPVVKFYGPQGT 
PYEGGWKVRVDLPDKYPFXSPSIGFMNKIFHPNIDBASGTVCL 
D VINO/TWTAL YDLTNTFES FL PQIiLAYPNP I DPLNGDAAAM YTiH 
RPEBYKQKIKBYIQKYATEKAI^KEQEBGTGDSSSESSMSDFSED 
EAQDNEL 


5571 


264 


946 


RDRRDRGGVATSTHEPARPRAPQSRGPGPVSQTGRGRERGGGZXr 

MSSPSPGIO«lMDTDVVKLIESKHBVTILGGIiHEPVVIO/YGPC<5T 

PYEGGVNKVRVDLPDKYPPKSPS1GFMNKI FHPNIDEASGTVCL 

DVIWOTVrrAI>YDI>TNIPESFI>PQLIiAYPNPIDPLNGDAAA^ 

RPBBYKQKIKBYIQKXATBEALKEQBKGnSDSSSBSSMSDPSED 

EAQDMEL 


5572 


2802 


2085 


RTDYRTGIPGRRFRVMAAGDGDVKI^TLGSGSESSNDGGSESPG 

DAGAAAEGGGWAAAALALLTGGGBMLIjN^ 

R WGREOLGAGAGAGEES PATSLPRHKKRDFShBQLRQYDQS RNP 

RILLAVNGKVFDVTKGS KFYGPAGPYGI PAGRnASRGliATFCLD 

KDALRDKYDDLSDLNAVOMESVKEMEMQFKEKYDYVGRIiLKPtJK 

EPSBYTDEEDTKBHNKQD 


5573 


' 2*62 


219 


VPARTPNAEI)QGPEARAATATPCQSGGRERAGEAAEIX3VKMAAF 
SRMGVMPE3CAQAVEEMDWLLPTDXQAES iPLIIXSGGDVLMAAET 
GSGKTGAFS I PVIQI VYBTLKDQQEGKKGKTTIKTGASVIiNKWQ 
MNPYDRGSAFAIGSDGLCCQSREVKEWHGCRATKaiiMKGKHYYE 
VSCKDG^LCRVGWSTMQASW)I«GTDKFGFGFGGTGKKSH^ 
NYGEEFTMHDT IGCYX^I DKGHVKFSKNGKDLGIiAFEIPPHMKNf 
OALFPACVLKNABLKFNFGEEEFKFPPKDGFVALS kapdgyivk 
SQHSGNAQVTQTXFLPNAPKAI*IVEPSRB1AB0TLNNIK0PKKY 
IDNPKLRELLI IGGVAARDQLSVLHNGVDIWGTPGRLDDLVST 
GKLMjSQVRFLVIiDEATCIiLSQGYSDFINRMHNQIPQ 
LQVIVCSATLHS FDVKKI*S EKI MHFPTWVDLKGBDSVPDTVHHV 
WPVNPXTDRLWERLGKSH I rtddvhakdntrpgans pemws ra 
I KILKGE YAVRAI KEHKha)OAI IFC^TKIDCCNLEQYFIQQGGG 
PDKKGHQFSCVCXHGDRKPHFJ^KCNLERFKKGDV^ 
RGIDlHGVP^VINVTIiPDRKQNYVHRIGRVGRAERMGLAI SLVA 
TEKBKVWYHVCSSRGKGCYNTRIJCFJ)GGCriWYN^ 
HUiCTISQVEPDIKVPWKFlXSKVTYGQKRAAGGGSYKGHVDIL 
APTVQEIiAALEKEAQTSFLHLGYLPNQLFRTF 


5574 


1731 


952 


NEGLEVFKEQBLQPEDKGAV PBDASTERS AMAS LGLQLVG Y 1LG 
IXGLIX3TLVAMIJ*PSWKn , $SYVGASrVTAVGFSKGLWMECATHS 
rciTQGDI YSTiLGliPADIQAACtfUWVTSSAISSliACI IS WGM 

rctvfcqesrakdrvavaggvffiijggllgfipvawnlhgilrd 

FYSPLVPDSMKFEIGEALYLGIISSLFSL1AGIILCFSCSCQRN 
RSNYYDAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 


5575 


456 


766 


LLWAIiPCPPPTAAAVLL3 S TGLMELliB KMLALTLAKADSPRTAL 
LCSAWLLTASFSAQ^HKGSLQKDPLLSO^CV^ 


5576 


249 


2146 


RS>OU>WFWP^RLLRRR>IMPLRlJ\MV^ 
EKATEKPWUKSLVSRKDHVLD^ 

QQTLPS INOjSCLPGFYTPAELKPFWBRPPQDPNAPGADGKAFQK 
SKHTPI^TQBXBEG YKKHCFNAFASDRI SLQRS LGPDTRPPBCV 
DQKFRRCPPLATTS VI I VFHNEAWSTLLRTVYS VLHTTPAII JJC 
EIILVDDASTEF^KEKI^QYVKQLOyVRVVRQEFJlKGLITARL 
LGASVAO^VLTPLDAHCECFHGWLEPLI^IABDKTVVVSPDI 
VTIDLNTFEFAKPVQRGBVHSRGNFDWSLT^ 

kdbtypiksptfagglfsisksyfehigtydnqmeiwggbnvem 
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residue of 
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Predicted end 
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location 
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residue of 
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sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D-Aspartic Acid, B« 
Glutamic Acid, ^Phenylalanine, G^Glyeine, 
H=Histidine, I=Isoleucine, K=Lysine, 
I»=Leucine, M=Methionine, NsAsparagine, 
P-Proline, Q=Giutamine, R=^Arginine, 
S^Serine, T— Threonine, V»Valine, 
W-Tryptophan, Y»»Tyxo-3ine, X^Unknovrn , *=Stop 
Codon, /^possible nucleotide deletion, 
\-possible nucleotide insertion) 








SFRVWQCGGQLSl I PCS WGHVFRTKSPHTFPKGTSVIARNQVR 
lAE^^SYKKIFYRPJJLQAAKMAQKKSFGDISERLQLRBQLHC 
HNFSWYIJINVYPEM1WIXLTPTFW 

GKPLIMySCHGIX3GNQYFEYTTQRDLRHNIAKQLCLHVSKGALG 
LGSCHFTGKNSQVPXDEEWEIiAQDQLIRNSGSGTCLTSQDKKPA 
MAPCNP3DPHQLWLFV 


5577 


3 


1275 


T>xrcnr > <^n -, i7Tcnrur"T tjui/t bit t\t mroccun^r^m vt tt t tuft y n 

YSI^LNDIjHVSPPELTVHVGDSALMGCV^ 

SPGEHAKDBYVIiYYYSNLSVPXGRF^RVHL^DIU^TOSLli 

QDVQEADQQTYIC^IRLIQ^SQVPKKAVVI, 

GGLIO^CVFQSTBVKHVTKVEWIFSGRRAKEBIVFRYYHKLRM 

SVEYSQS WGHFGfRVNIiVGD I FRNDGS I MLQGVRBS DGGNYTCS 

— »TT f »Uf TM?Vlfl>TTTy TT1FC t>l#l> Ptl*«l»1 t MmK * T T\ V\T "\4 f /lOkir\r irT TTt 

- HLGNLVr vJjHVS FBEPRTLVlFAALJ^LVlAaGNQL V 1 IV 
GIVCATILLLPVIiI IilVKKTCGlTCSSWSTVLVKNTKKTNPBl K 
EKPCHFBRCEGEKHIYSPIIVREVIE^KEPSEaCSKATYMTMEPV 
WPSLRSDRNNSLEKKSGGGMPKTQQAF 


5578 


3 


783 


AVBCMAS PGACRAP PELPERNCGYREVE YWDQRYQGAADSAPYD 
WTODFSSFRAI^PELRPFJ)RILVLGCGNSALSYBLFLGGFPW 
TS VDYSS VVYAAMQARYAHVPQLRWSTMt) VRKIiDFP SAS FDWL 
EKGTLDAI»IAGERDP1>iTVSSBGVHT^ 

SMTSAAPHFRTRHyAO^YYGWSLRHATYGSGFHFHLYLMHKGGK 
LSVAQUVLGAQILSPPRPPTSPCFIjQDSDHEDPLSAIQL 


5579 


3 


1540 


RNSGLARGASAIARHGGGLAGGVGWIXX5AC 

OtAI^AIATC^RQLSGYVPCRFHHCAPRRaRR 

REDRVI»S LQDKSDDI*TCKSQRU41iQVGIj I YPASPGCYHLLP YTV 

RAMEKIjVRVT DQEMQAIGGQKVNMPSLS P ABLWQATNRWDIiMGK 

BLIiKhRDRHGKSYCI^PTHEEAITALIASQKKTiS YKQLP FLLYQ 

VTRKFRDEPRPRFGLLRGRKF YMKDHYTFDS S PEAAQQTYSIjVC 

DAYCSLPTJKIASI^FVKVOaD^ 

ICPRCSFSANMETUDLSO^WCPACQGPLTKTKGIEVGIITFYIiGT 

KYSSIFNAQFTNVCGKPTIABKGCTGU3^^ 

CVRWPS LLAPYQACLI PPKKGSKjeQAASEL I GQLYDHTTEAVPQ 

LHGKVXilJ^nRTHLTIGNRJUKDANKFGYPFVI IAGKRAUSDPAHF 

EVWCQNTGBVAFLTKIX3VMDLLTPVQTV 


5580 


1681 


450 


ADAGTRC I PG FVVPSGAG YSAPAQRGRRS SGrMrAAAAPGLTAP 

VmiJX^C^RAGELGMAVPAAAMGPSAI^SGPGSMAPWCSVSS 

GPSRYVLGI^ELFRGIISKTREFLAHSAKVHSVAHSCDGRRIASG 

SPDKTASVFIJ^KDRLVKEIWYRGHGDSVDQI^^ 

ASGDKT IR I WDVRTTKCIATVNTKGBNINI CWS PDGQTIAVGNK 

DDWTF I DAKTHRSKAEBQFKFEWK I SWNNDNNMFFLTWCTJGC 

INILS YPKLKPVQS INAHPSNCIC I KFDPMG KYFATGSADALVS 

CTETGDKLWEVQCKSPl'FTVAWIIPKPJLiAFACDDKIX^KYDSSR 
EAGTVKkFGLPNDS 


5581 


64 


947 


GGGSGPRAP SATLLDTGKSVAAVASGEDKG IAASAAAAAVFACS 
CSPDPQSSThWPVYSPVQpG^YGNPKNMAYTGYPl'AYPAAAPA 
YNPSLYPTl^PSYAPEFXJFIiHSAYATI^KOAWPONSSSCGTSG 
TFHLPVIXrGTEimTYQASSAAFRYTAGTPYKVPPTOSNTAPPP^ 
SPSPNPYG/rAMYPrRSAYPG^NTjYACKJAYYTQPVYAAQPHVIHH 
TTWQPNS IPSAI YPAPVAAPRTjS^ VAMGMVAGTTMAMSAGTLL 
TTPQHTAIGAHPVSMPTYRAQGTPAYSYVPPHW 


5582 


5775 


2739 


IITNNlWVIlPLVIJ\YHI>SGSAQARGER3PA£RJiMERQKRKAI)I 
BKGI^FIQSTLPLKQF^YEAFIiLKLVQI^FAEGNDLFR^KDYKXJ 
ALVQ YMEGLNVAD YAASDQ VAL PRBLLCKLHVNRAAC Y FTMGL Y 
EKALEDS KKALGLDSBS IRALFRJQ\RALNEI/jRHICEAYE C£S RC 
SLALPHDRSVTQLGQEXAQKIjGLRVRKAYI^ 
TAAGVADG^SNGIXSSlJroiETIX^n^PRGSPALLPSTPTMPLF 
PHVLDIJ^IJDSSRTLPSTDSLDDFSIXSDVFGPRLDTLLDSI^L 
VQGGIiSGSGVPS EJjPQLI PVFPGGTPLLPPVVGGS I PVSSPTiPP 
ASFGLVMDPSKKLAASVLDftLDPPCT 
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rauino acta segtuejic. containiay i>J-yiiux pt;^Ji,-itu£ 

CA=Alanine f C=Cysteine, tfc=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glyr:ine, 
H^Histidine, I=Isoleucine, K=Lysine, 
L=Leucine„ M=Methionine, N^Asparagine, 
p=Proline, Q=«Glutamine, Rsd&xgiriine, 
S^Serine , T^Thr eonine , V^Val ine , 
W*»Tryptophan, Y-Tyrosine, X=Unknovm, *-Stop 
Codon, /=possible nucleotide deletion, 
\«=possible nucleotide insertion) 








DS FGSTRGSLDKPDSFHEBTNSQDHRP PSGAQKPAPS PEPCMPN 
TALLIKNPIiAATHRFKGACQLCYPKIG^ 

RDILDGRliRSSEDQTWKl?. IRPR PTKTSPVGSYYLCKDMINKQDC 
KYGDNCTFAYHQEE XDVWTEKR KGTLNRDLIjFDPLGGVKRGS JYT 

t»vt r vour\r»Ti7 r Pt?T.r , c»T r^crvc: vox? ttc? i7Tlf r irrvi'Dc:\7r , Q>vTT jvivir 
iAJSJUulVnnyuiX r 1 r iA^Jh L U? IVJr K 1 io dmj 1 lUJof o V UoNlj/mA. 

HSPYNNKCLVHIVRSTSIiKySKIRQFQBHFQFDVCRHBVRYGCL 
RKDSCHFAHSPIELKVWM^YSGMTHBDIVQESKKYWCKJMKAH 
AGXASSSMG^RTHGPSTFlXt^KFVCGQ^ 
YCSAKARHCWT KERRVLLVMS KAKRKWVSVRPLPS IRNFPQQYD 
LCIHAQNGRKCQYVGMCSFAHSPEBRDMOTFMKSJWCl LDMQQTY 
DMVJ LKKHN PGKPGEGTP I S SRBGBXQIQMPTDYAD 1 MMGYHCWL 
CGKNSNSKKQW0QHIQSRKHKBKVFTSDSDA5GWAFRFPMGEFR 
I.CDRl^KGKACPraDKC^CAIIGQBBI^WLDRRKVTjKOKXJauvR 
KDMLLCPIU>DDFGKYNFLLQBI)GDIJUaATP 


5583 


3 


1265 


SSGCRQGRPGRSDRPRPPPRRHKMVKKU'R ¥ YPII/3VKPSASPEE 
I KKAYRKLAI.KYHPDKNPDEGE KFKL ISQAYE VLSD PXKRDVYD 
QGGEQAIKEGGSGSPSFSSPMDIFDMFFGGGGRMARERRGKNW 
HQLS VTIjEDLYNGVTKKlJVliQKNVI CSKCEGVGGKKGSVE KCP L 
CXGR GMHXHI QQ IGFGMVQQI QTVCI ECKGGGERINPKDRCESC 
SGAKVIREKKI I EVHVBKGMKDGQKII*FIK5EGDQEPEI*EPGDVI 
iYLDQKIMISVFQRRGKDLIHKMKIQIiSEAIiCIGFKKrl KTLDNR I 
LVITS KAGBV I KHGDIjRCVRDEGM P I YKAPLE KGI LI IQFLVT F 
PEKHHIjSIiBKLPQtjEAIJiPPRQKVRITDI»IDQVELKBFCPKreQN 
WRQHREAYEEDEDGPQAGVQCgTA 


55B4 


3 


1265 


SSGCRQGRPGRSDRPRPPPRRHKMVKETRYYDILGVKPSASPEE 
IKKAYRKLALKYHPDKNPDEGEKFKLI sqayevlsdpkkrdvyd 
QGGE<^IKEGGSQSPSFSSPMD1FJWFFGGGGRMARERRGKNVV 
HQLSVT1iEDLYWGVTKK3CiAI*QKNVI CEKCEGVGGKKGSVRKCPL 
CKGRGMHIHIQQIGPGMVQQIQTVCIECKGQGERINPKDRCE5C 
SGAKVIREKKI I EVHVEKGMXDGQKILFHGEGDQEPELEPGD VI 
I^bQKDHSVFQRRGHDIiIMKMKIQI*S EALCGFRKTIKTIiDHRI 
LVI TS KAGEVI KEGDLRCVRD EGM P I YKAPLBKG I LI IQFLVI P 
PEKHWI^IiEKLPQLEAIjLPPRQKVRI^DDMDQVELKEFCplIEQN 
WRO^RmYEEDEBGPQAGVQCQTA 


5585 


2*1$ 


915 


IiPAGTPESSLHEALDQCMTAI^DLFLTNQFSEAIiSYLKPRTKESM 

YHSLTYATILBMOAMMTFDPQDIUAGNMMK^ 

SVTDSFSSLVNRPrLGQFTEEE IHABVCYAKCLLQRAALTFLQD 

tnBnrcDTtmnTtnrauQVfvi'wm ntiT xTr\{zf2nvf*KV**niivnmwf2fi 
KlWwof XKOoXtlVKJIa i\|f I JLn I-ilAo-ttV tf>oy i^MjSQI ntrtia aw 

VKLG VGAFNLTLSMLPTR I LR lA&WG PSGNKDYGLLQLEEGAS 

GHSFRSVLCVMW»LCYHTFLTFYEJG^ 

YPKGAIFLFI^RI EVIKGKIDAAIRRFEE<X^QQHWKX}PHHM 
CYWEJjMWCFTYICGQWKMSYFYJVDIiI^KE^ 
SMFGKEDHKPFGDDEVFX»FRAVPGtiKLKIRGKSL PT 
RRYFSSNPISLPVPALEMMYI WNGYAV igkqpkltdgile I ITX 
AEEKLEKGPBNBYSVDDBCLVKIjI^ 

ISANEK^IKYI^YLIPNALLBIJVLI^IJHEQDRNERAIKLLESAKQ 
NYK1WSMF^RTHFRI<^TLQ7^KS£LBNSSRSMVSSVSL 


5586 


2619 


915 


LPAGTPBSSLHFJUjDQX^flALDLFLTNQFSEATiS YLKPRTKESM 
YH3LTYATILEMQAMMTFDPQD I IjLAGWMMKEAQMLCQRHRRKS 
S VTDS FSSLVNRPTLGQFTEEE I HAEVCYAKCLLQRAAI/rFLQD 

VKLGVGAFNLTLSMLPTRXLRLLE FVG FSGNKD YGLLQLEEGAS 
GR^FRSVLCWflLLI/TYHTFLTFVI^^ 

YPKGAIFLFLAGR IEV1 KGNIDAAIRRFEECCBAQQHWKQFHHM 
CYWKIiMWCFTYKGQWKMSYFYADIiI^K^NCTSKATYIYMKAAYL 
SMFGKBDHKPFGDDEVELFRAV PGLKLKIAGKSLPTEKFAIRKS 
RRYF5SNPXSLPVPALEMMYIFWGYAVIGKQPKLTDGILEI ITK 
AEKf^KGPEl^SVDDRt^VKLLKGLCLKYT^RVQEAEENFRS 
T SANEKK I KYDHYL I PNTUXELALLIiMB^DRNEEAIKLIiESAKQ 
NYKNYSMESRTHFRIQAATLQAKSSI^NSSRSMVSSVSL 


5587 


176B 


148 


SSAVPlX^VGRP\^VAVGGPPHSCttCRPCX3iM^ 
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DwmI -4 r* b F»r? end 

nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G*=Glycine, 
H=Histidine, I»Isoleucine, K=Lysine, 
L=I*eucine, M=Methionine # N=Asparagine, 
P= Prol ine , Q-Glutamine , R=Arginine r 
S=Serine, TVThreonine, V«*Valine r 
W=«Tryptcphan, Y^Tyrosine, X-Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








CVAVYKDGRAGWANDAGDRVTPAWAYSENKB1VGLAAKQSRI 
RNISm^KVKQILGRSSSDPQAQKYIAESKCLVIBKNGKLRYE 
IDTGEETKFVKPEDVARLI PS KMKBTAHSVIX3SDANDWITVPP 
DPGBKQKNAIX3EAAilAAGENVljRLIHB?SAALIAYGIGQDS PTO 
K3NI IiVFKUGGTSIiSLSVMBVNSGI YRVLSTNTDDNIGGAHFTE 
TLAQ YLAS BFQRS FKHDWGNARAMMKLTNSAKVAKHSI>STLGS 
ANCFIiDSLYEGQDFDCNVSRARFBIjLCSPIjFNKCIEAIRGIjIj^ 
nopt Ann TtJKWT.rRryisii tp kt/V)I»i kdlfpavellns i p pdp 
\^pigaaieagii*igkenllvedslmiecsardilvkgvdesga 
srftvlfpsgtplparrqhti^apgsissvci,ex.ybsdgknsak 
ebtkfaqwi^dldkkengi^iiav^^ 
gkcbaisietas 


55B8 


3 


589 


TPPPPEQAMVAATVAAAWLIJ*WAAACAQQEQDFYD?KAVN1RGK 

LVSI^BMGSVSLVVNVASECGFTDQHYRAr^l^RDLGPHHFN 

VIAFPCNQFGCX5EPDSHKBIESPARRTYSVSFPMFSKIAW5TG 

AHPAPKYLAJC^SGKEPTWNPWKYLVAPIXSKVVGAWDPTVSVE 

RPQITALVRKLIIxLKRKDI* 


5589 


1884 


553 


I^QAWHEGGIGQTDKERGAAALPGEEGDPTRGRSLGRASWESGS 
PRRPRSP FS SFLPRPICLSIiEAKPtTS I &LJKKN Wb JU ItrKPLiAF/U> 
GLNRSSGLWUSPDRCRPRSRCSCSmiENPSPAAAl^KJVLCALLL 
ATLGAAGQP LGGES I CSARAPAKYS 1TFTGKHSQTAFPKQYPLF 

KEIPJ^EAI^VHA\^SAPAVPSGTGQT3AEI^VQRRHSLVS? 
WRI VPS PDWFVGVDSLDLCDGDRWREQAALDLYPYDAGTDSG7 
TTSSPNFATIPQDTVTEITSSSPSHX^ANSPYYPRLKAJjPPIARV 

■nT.T.OT uritJDDAT?Y OOZVDVT.PQl>T^TTOTVnQJV. €: lUPRTPT.nr r KV <: lTiW 
1 til ii< i iHy.n frvrVc x * Jr/>" vjJKOJtfJJJnaA. v uo/to vroi t j-iiw-c* v oure 

sswglogghcgrlgtksrtryvrvopanngspcpeleeeaecvp 

DNCV 


5530 


72 


896 


l£sSGAiiRIjLPAMVAWRSAFLVCT 

KmVK^TSSVKQPWDHriVlTi'NRPGTTRAPAKPPGSGLDLADA 

IiDDQDIXno^KPGlGGRERWNHVTTTTKRPV^ 

IJVDAIJ)DRNDRDIX3RRKPIAGGGGFSDKDI*EDIVGGGEYKPDKG 

mX^YGSNDDPGSGMVAEPGTIAGVASALAMALIGAVSSYISY 

QQKKFCFS I QQGLNADYVKGENIiEAWCEE PQVKYS'rljHTQSAE 

PPPPPBPARI 


5591 


68 


1494 


AGSSRRAAAERIIiVSAGCRSI^RASGVLLLPAEIjLPGEEEAMA 
DIGNKVSBQI^JAKMPMKKEAKPSAlO^ 

PVPVSEP^BPBPBPBPKPVKEEKLSPEPILWTASPSPMETSG 
C^AKKDLCX^SDVILAViroVDiUSIX^ 

RQIiEEEQAVRPKYIJCiGREVTGNMRAI L IDWLVQVQMKFRLt>QET 

MYMTVS I IDRFTXJNHCVPKKMLQLVGVXAMFIASKYEBMYPPEI 

GDFAFVTrNTYTKHQIRQ^MKIIjRAl^FGljGRPLPT,HFl^PAS 

KIGEVDVEQHTIAKYLMELTMIJ)YBMVHFPPSQIAAGAFCL^K 

IIJJNGBWTPTXiQ^tiSYTBESI^ 

TVKNKYATSKHAKISTLPC^SALVQDIAKAVAKV 


5592 


242 


924 


YGBS KDVWQKDLLSALVLTTVNCUjPTP I 

VGKSAIiVVRFLTKRFIHEYDPTLESrYRHQATIDDEVVSMEXLD 
TAGQBDT IQRBGHMRWGEGFVLVYD I TDRGS FBEVI>PLKNILDE 
I K3CP KNVTL I LVGNRADLDHS RQVS TEEGEKIiATELACAFYECS 
ACTGEGNITE I FYELCREVRRRRMVQGKTRRRS STTHVKQAIKK 
MLTKISS 


5593 


3 


1113 


HASGGR7\ANMAAERGAG(&(^QEMMEVDRRVESEra 

SSGIVADIiSEQ3lJaX3BBRGEBDPEEEHELPVDMETINLDRnAE 

DVDLNHYR IGKlEGFEVIjKKVKTIiCbRQIWiI KCIEin^EEIjQSIiR 

BkDLYI^QIKKIENLEALTELBILDISPTf^^ 

KLFLVNNKI SKlENI^CrtiHQI^LEI^NRIRAIENTDTLTNLE 

SLFI«GKNKITXI^kEALTNLT^ 

ELY LS HNG 1 EVIEGt*BNNNKLTWI«D IASNR I KKIEN I SHLTELQ 
BFWMNDNIiESWSDXiDEIJCGARSLETVYLERNPrjQKDP^ 
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HO; 


Predicted 
bscyirmincf 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid Begment containing signal peptide 
(A-Alanine , C~ Cysteine, D=*Aspartlc Acid, E- 
Glutamic Acid, F=Pnenylalanine r G=Glycine, 
H=Hi s tidine , 1=1 soleucine , K=Lys ine , 
L»Leucine, M-Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R*»Arginine, 
S-Scrine, T=Threonine, V=Valine, 
W= Tryptophan, Ye=Tyrosine, X^Unknown, *=Stop 
Cddon, /=posaible nucleotide deletion, 
V=possible nucleotide insertion) 








MIJUiPSvKQIDATFVRt 


5594 


3 


1113 


HASGGRAANMAAERGAGQQQSQKMMEVDRRVBSEBSGDEEGKKH 

SSGIVADLSEQSDKDGBERGEEDPBESHELPVDMBTINLDRDAE 

EfVDLIWiYRlGKIEGFEVLKKVKTliCLRC^ 

ELDLYDNQIKKI BNIiBALTEIiE IIiD I S FNLLRNI EGVDKLI^RIjK 

KLFL VNNKI S KX B3^NLHQIiQMI»ELGSNR IRAIKNIDTLTNLE 

SIjFIXSKNKITKXGJ^IiDALTNLTVLSM 

BLYI^HNGIEVIEGIJa^KLTia^ 

BFWMND^I^SNSDLDELKGA1?SLETVY^ 

MLALPSVRQIDATFVRF 


5595 


3 


1476 


ARWNGRWVQVPAWPGPGCGTNASOERQRQLPRAWRPVGRTLGSE 

PIAIAWSPPLYIJ'PI PLPSWAVSQPT PTLGTMPADLDYD I EEDK 

LGI PTVPGKVTLQKDAjQWZjI G I S IGGGAQ YCPC3UYIVQ VFDNTP 

AALIX5TVAAGDE I TGWGR3 IKGKTKVEVAKM IQEVKGEVT I HY 

NKIX)ADPKQGMSLDrVLKKVKHRLVENMSSGTADAIX^SRAIL^ 

NK5LVKRLEEI»ERTAELYKG^rEHTKNlALRAPYE 

VPS VI GVRBPQPAAS EAPVKFADAHRS IEKFGIRLiIiKT I KPMIiT 

DLNTYLNKAI PDTRLTI KKYLIJVKFEYT^ YCLKVKEMDDEEYSC 

IALGE P LYRVSTGNYE YRJh I LR CRQEARARFSQMR KDVLE KMSL 

IJXJKHVQDIVFOLQRLVST^KYYNIXmVIiRDADVFPI 

HTTLAYGI^EEPTDGREEEEEEDTAAGEPSRDTRGAAGPLDKG 

GSHCDS 


5596 


£98 


219 


GAVLAPSSLPAAEIAAQGES OSLEDLSNTSRPTSE VYKISFI FP 
NGDKYTX3IXn^TSSGlYERNGIGIHTTPNGIVYTGSWKDDKMNG 
FORLEHFSGaVYEGQFKDj^FHGIiGTYTFPNGAKY^ 
KGEGEYTaiQGTRMDVVTFHFT$f!SQT 


5597 


3 


731 


ISCKMAAIXMSSLPASWRSVTIiTHVEYPAGDLSGEailAYLSLSP 
VFVI VGP VTLI I FKRELHTISFI/X;iALNEGVNWL I KNVIQEPR 
POGGPHTAVGTKYGMPSSHSQFMWFPSVYS FLFkYLRtoHQTNNA 
RPLDLLWRIIVI^IXSLLAVAFLVSYSRVYLLYHTWSQVLYGGIAG 
GLMAI AWFI FTQEVLTPLFPRIAAWPVSEFFLIRDTSLIPNVLW 
FBYTVTRABAKNRQRKLGTKIiQ 


5598 




2440 


GIGPIAASFIFCKVASLYIFI^PPPPSVSGVPYSPANSSWSCAI, 
VPLI/3SGWPHPPAPS PCC&GQTMLKMIiS FKIiLIiIiAVALGFFEG 
nAJO?GERNEGSGARRRRCLNGNPPKRLKRRDRRMMSQLELLSGG 
EMLCGGFYPlU£Ca^RSDSPGIX3R^^ 

IKCALCSPHSQSLFHS PBREVTjBE^DLVLPLLCKDYCKEFFYTCR. 
GRIPGFI>0/rTADEFCFYYARKIX5GIjCFPDPPRKQWGPASNYIiD 
QMC^DKVEEISRKHKBNCFCXQBVVSGIiRQPVGAIjHSGDGSQR 
IiFIIiEKEGYVKl LTPEGEI FXBP YXDI HKLVQSGIKGGDERGLIj 
SLAFHPNYKKl^KLYVSYTTOQERmiGPKIJHlLRyVEYTVSRK 
NPIIQVDLRTARWLKVAELHHKHIX^LLrcPIX^FLYIILGDGM 
ITIIIDMBEMDGI^DFTGSVIJU^D^^ 

TNQPPEVFAHGLHDPGRCAVDRHPTDlNrNLTILCSDSNGKNRS 
SARILQI I KGKDYBSBPSLLBFKPFSNGPLVGGFVYRGCQSBRIj 

ygsyvfgdrngnfltlqqs pvtjcc+^ekplclgtsgscrgypsg 
hiwfgedblgkvyii^ssksmtqthngklykxvdpkrplmpee 
cratvqpaqtltsecsrlcrl^ycrptgkcccspgwegdfc^ 


5599 


326 


2440 

— _ 


GIGPIAASFIFCKVASLYIFI^PPPPSVSGVPYSPAlrfSSWSCAi 
VPLI/^GWPPHPPAPSPCCSGOTMIdMLSF^ 

EMLC53GFYPRLSCXriiRSDSPGIjGRLENKI PSVTNNTECGKLLEE 
I KCALGS PHS Q3 LFHS PHRH VLERDL VLPLIjCKDYCKEFFYTCR 
GHI PGFTiO^ADBFCFYYARKDGGLCFPDFPRKQVRGPASNYIJD 
G^EEYBKVEEISRianCHNCFCIQEVVSGLRQpVGALHSGDGSQR 
L F I LEKEGYVKXLTPEGKI FKEP YIiDIHKL VQSGI KGGDERGXL 
SLAFHPtfYKKNGKLYVS YTTNQERWAIGPHDHILRWEYTVSRK 
NPHQVDLUTARVFI^\^LHRKm^5GQLLPGPDGFLYI IIXIDGM 
I TLDDMBEMDGI^DPTGS VLRLDV^ 

TNQP PEVFAHGLI IDPGR CAVDRHPTDININLT I LCSDSNGKNRS 
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nucleotide 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
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amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=»Aspartic Acid, E=» 
Glutamic Acid, Fo Phenyl alanine, G=Glycine, 
H=Histidine, I=*Isoleucine, K«Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P= Proline , Q=Glut amine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y»Tyxosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=poss±ble nucleotide insertion) 








SARILQI IKGKD x^EPSJjLEFKPPSNGPLVGGFVYRGCQS ERI» 

YGS YVFGDRNGNFLTLQQS pvtkqwqekplclgtsgs crgyfsg 
HII^FGEDBI^EVTILSSSKSMTQTHNGKLYKIVDPKRPLMPEE 
CRATVQ PAQTLTS ECSRbCRNG YCT PTGKCCCS PGWEGD FCRTG 


5600 


1977 


1244 


SLRVIiSGHLMQTRDLVQPDKPASPKFIVTLDGVPSPPGYMSDQE 

EDMCFHGMKPVKQTAASNKGLRGLLHPQQLHLLSRQLRDPNGSF 

SNAEMSET*SVAQKPEIOiLERCKYWPACKWGDBCAYHHP I S PCKA 

FPNCKFABKCLFVHPNCKYDAKCTKPIX^FTHVSRJR 

AVAPPAPPSSSQLOfYFPAOCKMECPFYHPKHCSlFiriXK^PI^ 

TFYHPTXNVPPRHAIiKWIRPQTSB 


5601 


1977 


1244 


SIiRVLSGffljMQTRDLVQPDKPASPKFIVTLDGVPSPPGYMSDQE 
BDMCFEGMKPVWCnTAASNKGI^GLl^PQaLHLLSRQLEDPNrGSF 
StUVEMSEr^AQKPEKLLEROKYWPACK^DE^YHHPISPCKA 

fpnckfaekclfvh™cxydakctkpdcpfthvsrri pvls pkp 
a\^ppappsssq1»c^yfpackkmecpfyhpkhcrfntqctrpdc 
tfyhptinvpprhaucwirpqtsb 


5602 


246 


766 


YHTS CTVWRXAKEALENTEVPVGCI^^ 

NATRHAEMVAIDQVLDWCRQSGKSPSEVFBH'rVbYVTVEPCIMC 
AAAIiRLMKI PLWYGCQNE R FGG CZGS VI*NIASADLPNTGRP FQC 
IPGYRAEEA^MLlCrFYKQENP^PKSKVRKKECQQIZ^NMF 


5603 


1 


565 


FRGRTPISGGRRGCAQYPI PATPARSGENRTMPGAGDGGKAPAR 
WIjGTGLIjGLFI^PVTI^I^SVGKATDXYAW 
CFGFKDI^FRWTnJSSnAFKII^EGTVKNEKSDPKVTLKDDDRI 
TLVG5TKEKRNNIS IWiRDLEFSiyiX5KYTCHVKNPKENNLQHHA 
TIFLQWDRRMQ 


5604 


1 


1506 


EDIFPAQLLKLQRHERVWQQEPPVRDHRSWGGSGAGGVAGREWT 
DQGQVALGGHYMAEGEGY FAMS EDRTiACS P YI P LGGDFGGGDFG 
GGDFGGGDFGGGD FGGGGS FGGHCLDYCESPTAHCNVLNWBQVQ 
RLDG I LS ET I P IHGRGNF PTLE LQPS I.JVKVVRRRIiAEKR I GVR 
DVRLNGSAASHVUIQDSGLGYKDLDIiI FCADLRGEGEFQTVKDV 
VI^CLI0)FTjPE<3VNK^KITPLTLKEAYVQKMVKVCWDSDRWSLI 
S IiSIWSGKNVEIiKFVDSLRRQFEFSVDS FOI KLDS LI^ 
NPMTBTFHPTI IGES VYGDFQEAFDHLCNKI I ATRNPEE IRGGG 
IJ^KYCOT^VRGFRPASDBIKTJ^RYMCSRFPIDFSDIGEQQRKIi 
ESYLQIffiFVGIjKDRKYEYIjMTIJ^ 

ITMLAIRVLADQNVI PNVANVTCYYQPAPYVADANFSNYYIAQV 
QPVFTCQQQTYSTWIiPCN 


5*6-05 ■" 


35 


1821 


SQRSCPRSPSSPAPPWARCSNPDSRTGGVPVPRAWSAGGPALGL 

MAAPVRLGRKRPLPACPNPLFVRV«^TEWRDKATRSRHRTRFVFQ 

KALRSIMYPLPUtSGKBAKILQHFG^lK3^^ 

GF-HAPDS PSGENSP APQGRT AEVQDS SMP VPAQPKAGGS GS YKP 

ARHSGAR VI IJjVLYREHLNPNGHHFLTKEELLQRCAQKS PRVAP 

GSARPWPALRSUJ UWLVrjRTltQPARYSLTPEGIiEliAQKLAESE 

GLSLLNVGIGPKBP PGESTAVPGAASAEXiASEAGVQQQPIiBLRP 

GEYRVIiLCVDIGETRGGGHRPEIJjRELQRIJIVTHTV^ 

VWAQETNPRDPANPGELVIjDHIVERKRLDI>LCSSITIX5RFRJBQ 

KFRL I<RCGLERRVYXVEEHGSVHNLS LPESTlIiQAVTNTQVTDG 

FFVXRTADIKESAAYLALLTRGLQRLYQGHTLRSRPWGTPGNPB 

SGAm'SPNPLCSIinTFSDFNAGAIKNKAQSVREVFARQLMOVRG 

VSGEKAAALVDR YSTPAS LLAAYDACATP KEQBTLLSTI KCGRL 

QRN3jGPALSRTl.SQLYCSYGPLT 


5606 


3 


1099 


GRS RC PGPGARGGTMS PRS CLRS LRLLVFAVFSAAASNWLYIiAK 
LSSVGS ISBEETCEKLKGLIQRQVOMCKRNLEVWDS VRRGAQLA 
IKECQYQFRNRRWNCSTLDSLPWGK^ArrCX^TREAAFVYAISSA 
GVAFAVTRACSSGELBKCGCDRTVHGVSPQGFQWSGCSDNIAYG 
VAFSQS FVDVRRRS KG AS SS RALMNLHNNEAGRKAI LTHMRVEC 
KCHGVSGSCEVKTCWRAVPPFRQVCHALKEKFDGATBVEPRRVG 
SS RALVPRNAQFKPHTDEBLVYLE PS PDFCEQDMRSGVLGTRGR 
TCMKTSKAIDGCEI*IjCCSRGFiriAQVEIiAERCS CKFHWCCFVKC 
RQCQRLVELHTCR 
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ID 
CTO: 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


{A^Alanine, (^Cysteine, D^Aspartic Acid, 
Glutamic Acid, F=Phenylalanine, G=Glycine. 
H-Histidine, I=Isoleucine, K=Lysine, 
I*=Iieucine, M=Methionine, N^Asparagine, 
P^Proline, Q-Glutarnine, R-Arginine, 
S=Serine, ^Threonine , V^Valine, 
»»Tryptophan, Y^Tyrosine, X=Unknown, *=Stop 
Codon, /=possihle nucleotide deletion, 
\=possible nucleotide insertion) 


5607 


521 


141 


PPV(^PAKAMPSPGTVCSL1^LGMI.WI,DIAMAGSSPLSPEHQRV 
QQRKBSKKPPAKLQPRALAGWLRPKDGGQAEGAEDELBVRPNAP 
FDVGIKLS GVQ YQQHS QALGKFLQD I LW EEAKEAPADK 


5608 


1 2 


983 


Wl^QSPLRQADPGPPRHTLFMDFYAGAIGGVCGDAVGY PLDTVKV 
RIQTBPKYTGIWHCWDTYHRBRVWQFYRGI,LLPVCTVSLVSSE 
VFGTYRHCLAHICRLRFGNPDAKP TKAD I TLSG CASGLVRVFLT 
SPTBVAKVRLQTQTQAQKQQRRI*SASGPIiAVPPMCPVPPACT 
KYRGPLHCLAWAREEGLCGIiYKBSSAL^ 
VLCEWI^PAGHSRPDVPGVLVAGGCAGVIAWAVATPMDVIKSRL 
QADGQGQRRYRGbriHCW VTIVRKEG PRVLFltGLVRfCCRAFPVN 
MWPVAYBAVLRLARGLLT 


5609 


1628 


304 


AKGVWVLPSPPPRPGRGALVSGSGLRRGRSGTSWRPRRMNHKSK 
rat. -i\n/4TJ^Ait±'i^iuJi> lujwrj, KnX* I I act If o US PAAVADNVERAD 
ALQLSVEBFVKRYERPYKPVVLLNAQBGWSAQEKWTLERIjKRKY 
RNQKFKGGEDNDGYSVKMKMKYYIE YMESTRDDS PLYI FDSSYG 

ehpkrrkllrdykvpkpftddi.fqyagbkrrppyrwfvmgpprs 
gtgihidpi^tsawnalvqghkrmcl?ptstprelikvtrde?gg 
nqqdbaitwfwi yprl'qlptwppefkple i lqkpgbtvfvpgg 
wwhvvljjiidttij«tqnpa33thppvvwhktto 

ILKQEHPEIAVTiADSVDLQBSTGIASDSSSDSSSSSSSSSSnSD 
SBCKSGSEGDGTVHRRKiOlRTCSMVGNGDTTSODDCVSKRRSSS 


5610 


54 


1196 


LERTPASADMAWTKYQL PLAGIj^VTGS INTLSAKWADNFMAEG 

CXMSKB^PQHPFI^VGMFIjGBFSCLAAPYLLRCRAAGQSDSS 

VDPQQPFNPIJjP^pPALCDMTGTSIJmnLN^ 

VI I FTGIJPS VAFLGRRIiVLS QWLGI LATI AGLVWGLADIiLSKH 

LoynMikJii v j. x ASULtLi J. xraAU-L ± \fJ\X wWVJjEKKcVYKIiIfVHPIjRA 

VGTEGLFGFVILSLIjLVPMYYIPAGSFSGNPRGTLEDALDAFCQ 

VGQQPLI AVALTjGNISS IAFFNFAGI SVITCELSATTRMVLDSLR 

TVVI WAI^LATjGWEUUFHALQI 

LSRGRPLAEESEOERKLGGTRTPINDAS 


5611 


2 


$77 


FVLPNRX/3IPGSTFRGPGACASSSSLAASAKPGAGGS PAIAMSG 
RLSNP FOC3GKA POT «T .OTlPrWP DT Jin TlTOC1?T.r»nrvirv , eri»Ert, , rr m? 

KLTAFKEICraEFDIiNNEGEIDl>5SLKRMMEKlX^ 
ISEVTGGVSDTI SYRDFVWMM1^KRSAVI»KLV1WFEGKANESSP 
KPVGPPPERDIASLP 


5612 


1 


721 


ASRxXJYMDATIAPHRIPPEMPQYGfititlfll PET^QAMWLCKHLNS 
SLLTLKNL»I LNEFS YTATEARRLYLQRKTVPSAIJjVQLIQKRIA 
EEDCI KQGWIIjDGl PBTREQALR^QTI^I TPRHVI Vl^APxTn^L 
IE3?iniGKRIDPQTGEIiTnTFDWPPESEIQNRLMVPEDISBLBT 
AQKIiLE YHRNI VRVI PS YPKI LKVI SADQ PCVDVFYQALTYVQS 
NHRTNAPFTPRVLT.LGPVGS 


5613 


115 


1279 


RGVDPALRRAEKMbPLS I KDDKYKPPKFNLFGKISGWFRSIJ-iSD 

KTSRNT^FFFUXNl^FAFVreiJL,YGIWSNCI^LISDS FHMFFDST 

AILAGLAASVISKWRDNDAFSYGY^/RA^^ 

FI FSEGVERALAPPDVHHERLLIjVS ILGFWNTjIG I FVFKHGGH 

GRSHGSGHGHSR^IjFNGALDQAHGHVDHCHSHEVOKHGAAHSHDH 

AHGHGHFHSHIX3PSIiKETTGPSRQIIjQGVF^ 

ASAIFJ^ONFGI^IADPICSILIAILJ^SVIPMjRESVGIIjMQR 

tppllknslpck:yqrvqqlcx5vyslqeqhfwlcsdvyvgtlkl 
ivapo^arwii^qthniptqagvrqlyvqidfaam 


5614 


3 


1268 


li^r^hacplqagi^lixjrkpkairgregratnqgqgetqner 

apkgarqri^vmaer^qi^bfeiptgrbai^gnhsallrvadyc 

bdnyvqatdkrkaleewiafltxjaiiasvayqv^ 

lqgaalrqvbarvstlgqmv^^ 

qkvxapbblppltpycirrpi^fgclddi ghg i kdls tqlsrtgt 

i^rksikapatpasatiigrppripepvhlpvvpdgri>saassas 

s1asagsaegvggaptpkggaappapplpssldpppppaavevf 

qrpptl^elsppppdeelplpldlppppplogdbljgijpppppgf 

GPDEPSWVPAS YXBKVVTLYPYTSQKTJNEJjS FSEGTVI CVTRRY 
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to first 
amino acid 
residue of 
amino acid 
sequence 


nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C<= Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F-Phenylalanine, G=Glycine, 
H^Histidine, l=Isoleucine, K= Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P= Proline?, Q= Glut amine, R=Arginine, 
S=*Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y*=Tyrosine, X= Unknown , *=Stop 
Codon, /=possible nucleotide deletion, 
\ ^possible nucleotide insertion) 








SDGWCRGVSSEGTGFFPGNYVEPSC 


5615 


9 


1558 


ALGRRR PGD PREMEAAATPAAAGAARR E ELDMDVMRPL I NEQN F 

LLGLPLAlM^IVLGPISLVPIGlISVHC&mLVRCSHFLCLR 
FKKSTLG YS DTVS PAMEVS PWSCLQKQAAWGRSWDFFLVITQL 
GPCSVYIVPliAENVKQVHEGPLESKVPISNSTNSSNPCBRRSVD 
LRIYMLCFLPFIIIiLVFlRBLKNLFVI>SFLA^ 
YWRTWPDPI^PIVAGWKXYPLPFGTAVFAFEGIGWLPLENQ 

MJUS&iUCr F\2AIiWAv5lw lull 

I»PQDfVWr»YQSVKII»YSFGIPVTYSIQPYVPAKTIIPGITSKFHT 
KWKQ ICE FG I RS FLVS I TCAGAI LI P RLD I VI S FVGAVS SSTLA 
L I L ? PLVK I LTFSKEHYNIWMVLKN I S IAFTGWG FLLGTY ITV 
RE I rYPTPKWAGTPQS PFLNIiNSTCLTSGLK 


5616 


1 


719 


DDrVRCXSPQSAAMGASARLIiRAVIl^ 

LKHLSSGDliLRDNNl£Gra 

HEIiKMjTQYSWIjLDGFPRTXiPQAKALDa^^ 

IKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQRRDD 

KP ETV I KRLKAYBDO/rKPVLEYYQKKGVLKTFSGTF/TNKI WPY V 

YAFLQTKVPQRSQKASVTP 


5617 


176 


765 


PWRGRGSRPRGAGAMAKEQVNRSAGLAPDCEA&ATABTTVSSVG 
TCBAAGKSPEPXDYDSTCVFCRIAGRQDPGTELIiHCENEDLICF 
KDIKPAATHHYbVVPKIOnGNCRTLRKDOVBL^ 
ERNNFTDFTNVRMGFHM PP PCS I SHLHLHVLAPVDQLGPLS KLV 
YRVNSYWFITADHLIBKLRT 


5618 
• 


3 


1692 

- 


YI^YINLKSEN)0>SGKEDLWEKLQYLWKSTLNLPEDLLRVPDES 
LFLNSGGDSLKSIRLLSEIEKLVGTSVPGLLBirLSSSILBIYN 
HILO/TWPDEDVTFRKSCATKRKI^ 

FTCHNEINAPVVLSRGSQILSLNSTRFLTKLGHCSSACPSDSVS 
OTKIQKI^I^SPVLIGKSKDPSOn^SEEGKPAIGTQKMELH 
VRWRSDTOKCVDASPLVVIPTFDKSSTTVYIGSHSHRMKAVDFY 
SGKVTOT3QILGPRIESSACVSKCGNFIVVGCYNGLVYVLKSWSG 
KKYWMFTTEDAVKSSAT5IDPTTGL1Y1GSHDQHAYALDIYRKKC 
iiuvc irrvyrir pc o iv*t wr t duut vpatt rvtiT t t.xw«Di*?YajWTW 

VW K2> MJLiljr A V V O C» FLJjJM JL» 1 FHHLt XtitlJ^^luJjAVJNt'ftXViB* V J.W 

KHSCGKPI^SSPQCCSQYICIGCVDGI^^ 
GPIFSSPCTSPSEQKIFFGSHDCFIYCCNMKGHLQWKFETTSRV 
YATPPAFflNYMGSNnBMtJCAAASrDGKV^ 
GEVFSSPVVI*ESMLIIGCPJDHYVYCXDLIiG<afQX 


5619 


2160 


1477 


DSPVLPTSGHVISTAQPAQPWSAVKAAIJ?SLGSPPGAGRGCPCP 

RSCPQPRPLEEIJJlAGSSTRPQPLTSSCCGMSCMYSFLGBCSVt. 
LWGfTKGRGSGSPSSPGCC^PPAC^SQDLPLVHVBVGWQPPLGP 
TVGLRPG1^F^QRGAIJ^DPQC^PI.PATVREDI*GVPSPWAA 
ECSPPATP 


5*20 


930 


1B2 


PLPPPTUVMFLTRSBTORGVNTFSPEGRLFOVEYAIEAIKLGST 
AIGIOTSEGVCLAVEKRITSPLMEPSSIEKIVEIDAHIGCAMSG 
LIADAKTLIDKARVETONHWFT^ 

EEIW)PGJU^RPBX5VAX^FGGVDE3CGPQLniMDPSGTFVQCDAR 
AlGSASEGAQSSLQEVYHKSMTliKEAIKSSLI IXJCQVMEEKLliA 
TNIEIATVQPGQNFHMFTKEELSEVIKDI 


5621 


3 


819 


VVEFVKYTATDlANVKN^I^ 

ElTOLTWVIiKHCERFIJC(XOTSIKSSIiCXQG3*Y^ 

MIMLGDKEKTTQFIJIQFSRLLTSAFI^LPPJmSSYLPNiyrVEJS 

GIHPVYFCSrHYlEMLUCABIJ>LVFSAFEMSGFAPSQICLQWIT 

QCFMNYUWIEIOlYIJlICVF^ 

in^TQDI^VFLKBBALHGFRVSDYPEYME I LRQN YRTVLLRDMR 
NIRLQST 


5622 


1122 


- 456 


AASTKDAVSRKRSHS ASEKSGTGTS ISKRLNMN PQ I R2J P M KAMY 
PGTPYTOFiaiLWBAiroRNKTS^ 

QVDSETHCHABRCFLSW FCDDILSPNTKYQVTWYTS WSPCPDCA 
GEVAEFLARHSNVNLTI FTARLYYFQYP CYQKGLRSLSQEGVAV 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
correspond ing 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A«*Alanine, C^Cysteine, D^Aspartic Acid, E— 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I-Isoleucine, K=I*ysine, 
L=l»euclne, M=Methionine, N=Acparagine , 
P=Proline, Q=Glut amine, R^Arginine, 
S-Serine , T=*Threonine , V=Valine , 
W=Tryptophan, Y-Tyrosine, X-Unknown, *»Stop 
Co don, /=>po6sible nucleotide deletion, 
\«=»possxble nucleotide mseirtion) 








KIMDYEDFKYCWBNFVYMJNEPFKPWKGLKTHPRLLKHRI.RBSL 
Q 


5623 




954 


FLPFF IRAPKI SRNGQWLFTFTTP FPFANKALPGWEGIVPACFW 

RKKI LTPSTGTMHLLQVT I L FLLP S I CSSNS TGVLEAANNS I>W 

TTTKPSITTPNTESXjQKNVVTPTTGTTPKGTITNBIiLKMSLMST 

ATFLTSKDBGLKATTTDVRKNDSIISNVTV^ 

SKPKTETQS3 1 KTTE IPGS VLQPDAS PSKTGTLTSI PVTIPKNT 

SQSQVIGTEGGKNASTSATSRSYSSI ILPWIALIVITT.SVFVL 

VGLYRMCWKADPGTPENGNDQPQSDKBSVKIiTVKTISHESG^ 

SAQGKTKH 


5624 


159 


898 


PGVAAAAGALPQYHGPAPALVSCRRBLSLSAGSLQLERKRRDFT 
SSGSRKLYPT)THALVCLt>BDNGFATQQAEI I VSALVK3LEANMD 
IVYKDMVTKMQQEITFOXJVMSQIANVXKDMIILEKSBFSLA^ 
NEKIKLELHQLKQQVMDBVI KVRTDTKLDFNLEKSRVKBLYS LN 
EKKI J4ELRTB rVALH7\CK^RAIiTCnT>RKIETWAGLKTMIiK SI0C 
LDNIKYLAGS I FTCLTVALG FYRLW I 


5625 


1 


1IB0 


TI PS SAAAQRAGP P AGAI*EAI>SPGGARAHABRRGEMRAT PliAAP 
AGSI»SRKKRLKLDDNI^TERPVQKRARSGPQPRJ J PPCLI.PLSPP 
TAPDRATAVATASRUJPYVIJjEPEEGGRAYQALHCPTGTEYTCR 
VYPVQRAIAVIjEPYARLPPHKHVARPTKVLAGTQIiLYAFFTRTH 
GDMHS LVRSRIIR I PBPKAAVL FRQMATALAHCHQHGIiVLRDLKL 
OlFVPADRERKIO.VLENIiEDSCVl^PDDSLVraifflACPAYVGPE 
ILSSRASYSGKAADm^IiGVALFTMIAGHYPFQDSRPVLIiFGKI 
RRGAY M »PAG t»S APARCI1VRCI1I1RRB PAERLTATGIUaHPWI.RQ 
DPMPIJVPTRSIILWEAAQVVPDGLGIJJEAREEEGDREVVLYG 


5626 


3123 


2011 


PPRALGSVAMENQVLTPHVYWAQRHRKEYLRVELSDVQNPAI S I 
TENVIiHTKAOGHGAKGDNVYEFHLEFLDLVKPE PVYKLTQRQVN 
ITVQKKVSQWWEMKLTKQEKRPIjPIiAPDFDRWLDESDAEMEIiRAK 
EEERI^KLRl^EGSPETLTNbRKGYLFMYNT.VQFLGFSWIFVN 
LTVRFCII^KBSPYI)TFHTVADMMYFCQMIAWETINAAIGVTT 
SPVLPSLIQLLGRNFILPITFGTMEEMQNKAVVFFVPYIjWSAIE 
I PRYS FYMLTCI DMD WKVLTW1«RYTLW iPLYPtGCLABAVS VI Q 
SIPIFKETGRFSFTI^YPVKIKVRFSFPIjQIYLIMIFLGLYINF 
RHLYKQRRRRYGQKXKKIH 


5627 


3123 


2011 


PPRAI^SVAMEKQVliTPHVYWAQRHREiYiRVEI^DVQNPAlS I 
TKNVLHFKAQGHGAKGDN^YBFHLEFIiDL 

IWQKKVSQWWERIjTKQEKIiPLFXJU J DFDRWU)ESDAFJ^ETJ?AJC 
EBERI*NKIJiriESEGSPETLTNI*RKGYIjFMYNLVQFIX3FSWIFVN 
LTTOFCILGKRSFYDTFHTVADMMYP^ 

SPVI^SLIQI^GRNFILFIIFGTMEE^NKAVVFFVFYLWSAIB 
IFRYSFTOLTCIDMDWKVLTWUIYT^ 

SI P I FNETGRFS FTLP YP VKI KVRFS FFLQI YLIMI FLGLY INF 
RHLYKQRRRRYGQKKKKIH 


5626 


— t£ 

75 


1455 


VAGAMA^KCijKACjr SSGi»L>Ko Irl»w*bfc*Li£> I KVbAM 1 bob PLJtljJ? 

SLSPVARS FSACSVGLGRSSYRATS CLPALCLPAGGFATS YSGG 

GGWFGFX;iLTGNEKETMQSLNDPXAGYIiEKVRQI^ENASLESR 

IREWCEQQ VPYMCPDYQS YFRT I EFXQKKTLCS KAENAR1.WE I 

DNAKIAADDFRTKYBTEVSIJaQLVESBINGLRRI IZ>DI»TLCKS D 

LEAQVESLKEELLCLKXNHEEEWSLROai^ 

DI^VLEEiTO<X)YOTLVENHRRDAEDWIiDrrOSEBI^ 

QLQS CQAE 1 1 ELRRTVNALEI ELOAQHSMRIJALESTIiACTEAR Y 

SSQLAQtf QCM I TNVKAQIiAE IRADLBRG^QE YQVIiIJJVBARLE C > 

EINTYP^LLESmSKLPCNPCAPDYSPSKSCLPCIJPAASCGPSA 

ARTNCSARPICVPCPGGRF 


5629 


2287 


938 " 


GRPRS SSDKRNFLRERAGI>3 SAAVQTR IGNSAAS RRS PAAR PPV 

PAPPALPRGRPGTEGSTSLS7VPAVLWAVAVVVVWSAVANAMA 

NYIHVPPGSPEVPKtiNVTVQIX2BBHRCR^G^^ 

QBVTtt}LPTDGITNKLIGCYVGNTMEOV^ 

DEEVKSPT&VLQAHGCAPQLYCTFNI^ 

PAI FMIARQIJ^IHAIHAHMGWlPKSNIWLKMGKYPSL I PTGP 
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ID 
NO: 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residua of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A^Alanine, C=Cysteine, D-Aspartic Acid, B== 
Glutamic Acid, F~Phenyl alanine. G=Glycine, 
H=Histidine, l=icoleucine, K=Lysine, 
L=Leucine, M^Methionine, Kr=sAsparagine, 
P-Proline, Q=Glutamine , R=Arginine # 
S=Serine, T» Threonine , V= Valine, 
VMTryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








ADEDINKK PI^DIPSSQILQEEMTHMKBII^MLGSPVVLC3»IDL 
LCKNT I YNEKQGDVQFI D YEYSG YNYIAYDIGNHFNE FAGVS DV 
DYSLYPDRBLQSQWLRAYIiEftYKEFKGFGTBVTEKEVE ILFIQV 
NQFAIASOTFWGLWALIQAKYSTIEPDFWYAIVRFNQYFKMKP 
BVTAIiKVPK 


5630 


1194 


278 


G FWAI AQTCAHHLPPG5 PWLVPASPWRLPI^SS FGYRTJLTVALF 
TLICCPGSDEKYFEVHVRPKKIiAVEPKGSLBVNCSTTCNQPEVG 
GLETSLDKlLLDEQAQWKHYLVSNISHDTVLQCHFTCSGKQESM 
KSNVS VYQP PRQVILTLQ PTLVAVGKS FTI BCEVPTVBPLDSLT 
LFLFRGmTLHYBTFGKAAPAPQRATATFNSTADREDGHRifttC 
LAVLDLMSRGGNI FHKHSAPKMLE I YE PVSDSQM VI IVTWS VL 
IiSLFVTSVLLCTIFGQHI^QQl^TYGVRAAWRRriPQAFRP 




1053 


290 


SRVDDFVRPEPSRAEPSRSGRRRPARKAATMSVFGKLFGAGGGK 
AG KGGPTPQE A I QRLRDTEEMLS KKQE PLEKKI EQELTAAKKHG 
TKNKRAAIX^LKRXKRYEKQI>AQ IDGTI^TIBFQREAIjBN 
TBVUOWGYAAKAMIC&AHB^^ 

TA1 S KPVGFGEE FDEDELMARLEBLEQBELDKNIjIjEI SGPBTVP 
LPNVPS IALPSKPAKKKBBEDDEWKEXiENWAGSM 


5632 


3 


952 


WLGWS PPRRLWWGSLGAAQRPAVPVSGLARSLHVETRRPHRRA 
SVRVARGRLGVWAQPQPIiLPRPVGSRREMQPPGPPPAYAPl*NGD 
FTPVSSADABDIjSGSIASPDVKLNLGGDFI KESTATTFIjRQRGY 
GWLLEVEDDDPEDNKPLI^BLDIDLKDIYYiaRCVLMPMPSLGP 
NRQVWDHPDFWGPLAVVIJFSMISIiYGQFRWBWIITIWIFGS 
LTI ELIJ^VIX^EVAYGQVIiGVIGYSIjLPLIVIA 
WSTLIKLPGVFWAAYSAASLIiVGBEFKTKKPLLIYPI FLI»Yr Y 
FLSLYTGV 


5633 


771 


460 


QGCSKTMSVGRPBYRSSEFMEQLJUSSKI^QVPFFCCFTVVCLCN 
CLFENS VS KLYMLCFNFFMS I FFYSLS ITKUJLI YLWGIiS YQS L 
LLLLLSGHRPWGSSMV 


5634 


1446 


855 


PRATGR IRSRAAASRPRAGAGASGAE PRSGRERSRLSGRRAPAM 

ARNTLSSRFRRVDIDBFDBNKFVDEQBEAAAAAAEPGPDPSEVD 

GLLROGDMLRAHIAALRNSPVNTKNQAVKERA 

SSEiTCQAVQSIiDKNGVDIilMOTYro^ 

ALAVGGLGS I IRVLTARKTV 


5635 


3 


943 


DRGPRS TATDTGRAR VS b'WRFPLDPGVKNSNVQlSGEKRRFRTL 
RS LFHPFPVTRSGAPRAVLVGS S WPAKMVAPAVKVARGWSGLAIj 
GVRRAVLQLPGLTQVRWS RYSPEFKDPL IDXEYYRKPVEELTBK 
EKYVRBLKKTQLI KAAPAGKTSS VFEDPVrSKFTNMMMIGGNlCV 
IjARSIJ^ IO/TLEAVKRKQ FEKYHAAS AE BQATI ERNP YTI FHQAIi 
KNCEPM IGLVPlIiKGGRFYQVPVPIiPDRRRRFIiAMKWMITECRD 
KKHQRT^MPEra^HKLLEAFlIl^PVIKRKHDbHKMA 
HYRWW 


5636 


2253 


1143 


LKDTI CQH P PAEKKL Y I> YHR KLRB VKRNG I PRLPKD VFMDTHQG 

LTDVRAKVTGFSEGVVDS VKGGFSS FSQATHKAAGAWSKPRBI 

ASTjIRNKFCSADNI Pl^Kl^LEEGQVDIlAGKAIiGVI SNFQSS PK 

YGSEEDCSSATSGSVGANSTTGGIAVGASSSKTNTLDMQSSGFD 

ALLIffilQEIRETQARLEESFBTIiKFJIYQRDYSLIMQTLQ 

CERJ^BBQI^IiTBIiHQireil^KQEIiASMEEKlAYOSYERARDI 

QEAIiBAOTmiSKMELQOXMQQVVQLEGXE^ 

IJJWMAVLLVFVSTVANCVTO^ 

HWDALFSYVERFFSSPR 


5637 


948 


2532 


MSFLXSARANAKMMAAYNGGTSAAAAGHICniHHHHLPHLPPPHLH 
HHHHPQHHLHPGS AAAVHP VQQHTS SAAAAAAAAAAAAAMLNPG 
QQQPYFPS PAPGQAPGPAAAAPAQVQAAAAATVKAHHHQHSHHP 
O^QLDIEPDRPIGYGAFGVVWSVTDPRDGKRVALKKMPNVFQNL 
VSCKRVFRBtJCMLCPFKHDNVI^AI^ 

IJ40^DlJIKIIVSP^PLSSDHVKOTI,YQII^GI,mJISAGIIJmD 
I KPGKfIiVNSNCVLKICT3Ft3t^VKFm 

PE ILMG SRH YSNAI DI WS VGCI FABLLGRR I IjFQAQS P XQQhDh 
I TDLLGTPSI»EAMRTACEGAKAH I IjRGPHKQ PS LP VLYTLSS QA 
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SBQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corre sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
'corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C«Cysteine, D-Aepartic Acid, E=« 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H-Histidine, I=Isoleucine, K=Itysine, 
L-Leucine, M<=Me thionine , N=Asparagine , 
P=Proline» Q=Glutamine , R=Argiuine f 
S=Serine, T=Threonine, v^Valine, 
W=Tryptophan, Y^Tyroslne, X=Unk»ovm, *-Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








THEAVHIiLCRMLVFDPYKRISAKDAl^PYI,DEGI^YHTCMCK 
CCFSTSTGRVTTGDFEPVTNPKFDDTFBKNLSSVRQVKEIIHQP 
II^QQKC^VPLCINP^SAAFKSFISSTVAQPSEMPPSPIiVWE 


5638 


125 


1155 


drkmski^lrqeaeoi.knqirdaria\camtlsqitnniupvg 
ri(2hrtrrtiirghlakiyamhwgtdshllvsasqd6kli iwdsy 
t™cvhaiplr£swvmtcayapsgnyvacx^ 
egnvrvsrblaghtgyi^ccrfijjdnqi^ 

GQfflTTTFTGHTGDVMSLSLAPDTR^ 

CRQTFTGHES DI NAICFFPNGNAFATGSDDATCRItFDLRADQEXi 
KTYSHDNIICXSITSVSFSKBGRLIjLAGYDDFNCNVWnAIiKADRA 
GVI»AGHDNRVSCU^rn)DGIlAVATGSWDSFLKIWN 


5639 


125. 


1155 


DRKMSBLIX1I^QEAEQI«KNQIRI)ARKA^ 
RIQMRTRRTLW33aAKIYAMHWGTDSRJ^ 
TTNKVHAIPLRSSWVMTCAYAPSGNYVACGGIJ5NI CSrm^KTR 
EGNVRVSRBIiAGHTGYI.SCCRFLDDNQIVTSSGm'TCALWDIBT 

CRQTFTGHESBINAlCFFPNGlB^ATGSDnA'rCRLFDLRADQEIi 
MTYSHDNI lOGITSVSFSKSGRLLIjAGYDDFtf C3¥^ 


5640 


2 BO 


1092 


QQGNKKTMLSHNTMMKQRKQQAT^ 
IPRDIM3^BmSHl£TOGWlLETaOTQR^ 

HHS IAMQtTOKVIX^NIiEGGSQQAPLTPPNTPDPRSPPNPbNIAP 
GYSGPLKEIPPEKFNTTAVPKYYQSPWEQAISNDPELLBALYPK 
LFKPEGKAELPDYRSFNRVATPFGGFEKASRMVKFKVPDFEIib 
LTDPRFMSFVlfPLSGRRSFNRTPKGWISBNIPIVTTTBPTDDTT 
VPBSKDIj 






332 


^HNCNGDVKLLSNQMDKLFAFHLFTFHGLIJiFIJX3SIQKLIQA , 
EIILSDHSSII.VLENNPLFKVKSKQFIHLIAKKPYISITIVSAS 
NGBSFVLSMIVTG 


5642 


199 


1247 


ITPCRMDFIiVLFLF YI»AS VLMGLVLI CVCSKTHS LKGLARGGAQ 

IFSCI IPECL^RAMIIGLIjHYIjFHTRKHT F I VLHLVIjQGWVYTE Y 

TWBVFGYCXJBLBLSLHYLLLPYI^GWLFFFTIjTCGTNPGI it 

KAKBIJjFIjHVYEFDBVMFPKNVR€STCD^RKP^ 

vhr fdhhcvwvnnc i gawn i r yfl i yvltltasaatvai vs ttf 

fcVHLVVMSDLYQF/TYIDDI/3HLH^^ 

LGFVVVLS YIXFVLYLAATN^ 

VAWPPSAEPQVHRNIHSHGIJlSNLOEIFI,PAFPCHBRKKQB 


5643 


1 


847 


PSGQVRD\^TRGPX33RAARGPRVVMERRGVGAGA1AKKKIjAEAK 

YKKRGTVIiAEIXJIiAQMSKQLDMFKIKIiBEFASKHK^ 

RVQFQI^CATIGVDPLiASGKGFWSKMrCTGDFYYBLGVQlIEVC 

IiALKHRNGGLITLBEI^O^VLKGRGKTAQDVl^DDLIRAIKKl^ 

ALGTGFGI IPVGGTYLIQSVPAKlJ^HTVVLQLAEKNGYVTVS 

BIKASLKWBTFJU\RQVIJBHliLKBGIiAWIJ)LQAP^ 

TDLYSQBITAEEARBALP 


5644 


B3 


1138 


PRRMGSVJVQLITSVGVQ^mPGWTVAGQFQBKKRFTEEVIEYFQ 
KKVSPVHLKILLTSDFAWKRFWVAELPREEADALYBALKNI.TP 
YVAIBDKDMGX)KEQQFRBWFIJCBFPCIRWKIQESIBRLRVIANB 
IEKVHRGC\TLANWSGSTCILSVIGVMIJ^PTAGLSI*SITAAGV 
GLGIASATAGIASS I VENTYTR SAELTAS RLTATSTDQLEALRD 
ILHDITPNVI^FA1^FT)EATKMIAN0VIITLRRSKATVGRPLIAW 
RYVP INWETLRTRGAPTR I VRKVARIJLGKATSG^VVLDVVNL 
VQDSLDLHOCGEKSKSAELIjRQViAQF^ 


5645 


537 


799 


VQSVRDJjKRLSPTDPPGDSGNRDVTREDPVTGPLNSASSQVPTL 
YLO^SIOiGHSSVKDARATMELYQISQRIRARRGLPRLAVSD 


5646 


3745 


| 3328 


AEOYGTS PHLLPThU%T^SCLPPANVTTKAATPPPLVLSLTTADP 
AGKPAPCRVTLTLLRAS ipatkrasflssfikmpfbeleyilgf 

LSIJjKFHVHVS VYSAI CHFYjKEGlt^ISRSFTCn^PBLFPRLQTHL 
RAEGGAQ 


5647 


286 


800 


GVTMATS ELSCEVSEENCERREAFWAEWKDLTIiSTRPEEGCSI*H 
EEinBRfi^TYHQQGG^QVLVQRSPWLMMRMGII^^ 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 1 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A- Alanine, C~ Cysteine. D-Aspartic Acid, E» 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H-Histidine, I^Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N~Asparagine, 
PMProline, Q=Glutamine, R^Arginine, 
S=Serine, ^Threonine, V-Valine, 
W=Tryptophan, Y^Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








QRV^LPIFTPAKMGATKEERKiyrPIQLQELI^BTAXX^CVD 
RQEVAEITKQLPPWPVSKPGALRRSLSRSMSQEAQRG 


5648 


7 


1518 


VI^3I^GRUEAIi?EVGABWPPPTCSPNICSGLQQAGNTDWSIi'rM 
APQS LPSSRMAPLGMLLGIJjMAACFTF GLSHONLK e paltnpe k 

sstxeterketkaef^lqaevlevfhptheno^^ 

HVRIiNIX/rcEREAKIiQYKDKFPJIinjKG KRLD INTOTYTS QDLKS 
ALAKFKEGABMBSSKKDKAiKJAEVKRLFRPlEBLKKDFDKLNVV 
IBTDMQIMVRLINKFNSSSSSLEEKIAALFDLBYYVHCH^^ 
LLS FGGLQ WINGr^STEPLVKEYAAFVLGAAFSSNPKVQVBAI 
SGGAI^KLIjViriATBQPLTAJaaCVLFALCSLLRIJPPYAQRQFLK 
IX5GL0VLRTLVQBKGTE VI»AAm VVTLliYDI*VTEKM FABKEAELT 
QBMSPEKIXXiYRQVHLLPGLWBQGWCEITVHH^^ 
LQTLGVJuLTTCRDRyRQD PQI/?RTIaASL<2AEYQVLAS IJSLQDGE 
DEGYFQBUjGSVNSLLKEIiR 


5649 


1172 


3006 


KLQKQ1UDAINEEIRMIQBBKBSTELRAEEI BTRVTSGSMEALNL 

KQIJiXRGSIPTSLTDLSIiASASPPLSGRSTPKLTSRSAAQDIjDR 

MGVMTLPSDLRKHRRKLIiSPVSREENREDKATIKCETSPPSSPR 

TLRIjEKLGHPALSQEEGKSAIiEIXJGSNPSS SNSSQDSIiHKGAKR 

KGIKSSIGRLFGKKBKGRLIQLSRDGATGHVLLTDSEFSMQEPM 

WAKLGTQABKDRJlIiKKKHQI*t 1 EDARRKG^ 

ELWVGMPAWYVAACRANVKSGA1MSALSDTEIQRSIGISNAUIR 

IJa.RIAIQEMVSLTSPS^PTSRTSSGNVWVTHEBMETI»ETSTK 

TDSBEGSWAQTlAYGDMNHEWIGNB^PSlX5LPQYRSYFMBa>V 

DARMLDHLTKKDLRVHLHWVDS 

i^krree&qhbi kdvi>windq^ 
hgaiilaldenfdhntlal i lq i ptqntq/^rqvmere fnnllalg 
tdrkldj^ddkvfrrapswrkrfrpp^hhgrggmi>sasaet:lpa 
gfrvstlgtriqpppappkkimpeahshyt.yghmlsafrd 


5650 


1172 


3006 


MLQEQLDAINEEIRMIQEEKESTELRAEKrKTRVTSGSMRAIiNL 
KQLRKRGS I PTSLTDLSLASASPPLSGRSTPKLTSRSAAQDIiDR 
MGVWIXPSDLRKHRRKIjLSPVSRRBNREDK^ KCETS PPSS PR 
TLRLEKLGHPALS QEEGKSALEDQGSHPSSSNSSQDSLHKGAKR 
KGIK3SIGRI*FGKKEKGRLIQl^RIXmTGHVIiLTDSEFSMQEPM 
VPAia^TQABKPRRLKKiaiQLLEDARRKGMPFAQWDGPTVVSWL 
ELWVGMPAWYVAACRANVKSGAI MSALSDTEIQRB I GISNALHR 
IxKLRLAIQEMVSLTS PSAPPTSRTSSGNVWV T mEEMETLE'TSTK 
TDS EEGS tf AQTIAYCDMNHRW IGNEWI>PSIK?LPQYRS YFMBCLV 
EftRMLDHLTKKI>I^VHI*KMVDS 
LEKRREBSGJIBIKDVLVWTOn>3VV^ 
HGALIiALDENFDHNTLALlI^ 
TDRK1»DIX5DDKVFRRAPSWRKRFRPRBHHGRGGM 
GFRVSTIXH'IiQPPPAPPKiaCMPEAHSHYLYGHMLSAFRD 


5651 


646 


1869 


ARQGQRQPWG*EARAKGPASESPRV*EGSGWEGPASP*TPGSTL 
AWGBGAGI R* ASGLTAAGAASAAAA/PPPTRGGPAPAGCGRAPP 
WPAPLRVPTHGRAPAPRS RAAPRAPALSHGTAAAALSPAS PAGP 
ADP *LPGHSSQS PPRG *RW0RSRSAPAPAHPEHPAPAGSASASQ 
QTPGWPGSCC^O^0^VBPriGAPGAEDG\PVPPQRGFPIiGTIiGS 
P AGSWAGLAGYG * AGAPGTQATAPRAAGQT PVAAAPNCR V*GSA 
PALHRAPAAADPGSPLQAPPRAWAS PAAAGPGLSSSDYCGGLGA 
GWRAGISPBLLGAAGLSD1WARCPGPGPAE*GGQPGCRTIPASA 
(34PSPPVEGSJjGLSRKGEGDLPSQAR*GWHBCRRARHLVPLPRL 
LGPRGRTGRPSSPS 


5652 


735 


343 


HHKKYQH1HQKSFSCPBPACGKS FNFKKHLKEHMKLHSDTRDYI 
CEFCARSFRTSSNLVIHRRIHTGEKPLQCEIC^FTCRQKAS^ 
BQRKHAffTVAAI»RFPCRFCGKRFBKPDSVAARRS^ 


5653 


66 


1401 


RGRLQSRGRLTLGLVUjI*LD I LGARQHGQRVSHGWKGGFLTAPIi 
CFPQPCQPGTRRGRRRSLKEATBPQLAMAEEFVTLKDVGMDFTL 
GDWEQI^LEC^TFm^AiDNCQDLFLr^PPRPNLTSHPDGSBD 
LBPLAGGSPEATSPDVTBTKNSPLNEDFFEEGFSQEl/SRDVrQ 
GWLLELQFRRSLYRGHLVR* FARRSRKS SEV* YCHQRGKSHGMQ 
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Amino acid segment containing signal peptide 
(A^Alanine, Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I^lsoleucine, K=Lysine, 
L=Leucine, M=Methionine, M=Asparagine , 
P^ Proline, Q^Glut amine , -R=Arginine, 
S=Serine, T=Threonine, V -Valine, 
W=>Tryptophan, Y= Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\ ^possible nucleotide insertion) 








ES* I KERTQS CVHR FHGRRFHG \ DNVS K KTLTPAKS KBYRGEF F 
SYSDHSQQDSVQEGEKPYQCSBCGKSFSGSYRLTQHWITHTREK 
PTVHQECEQGFDRKASHSGYPKTHTGYKFYVCNEYGTP FS QSTY 
LWirQKTHAGEKPCKSQDSDHPPSHDTQSGBHQKTHTDSKSYNCN 
EOGKAFTRI FHLTRHQKIHTRKRYECSKCQATFNLRKHLIQHQK 
THAANV 


5654 


3 


598 


TLPLFPGRRFRGHRRCGAVAAR KNSTGGNVS T NQRRDS VRMSAL 
NWKPFVYGGLASlTABCX?TFPIDI.TKTRPQIQGQTOn^ 
YRGMLHALVR IGREKGLKAI*YSG * VGLHAFLCHCSIjFHMG IDFR 
PRLHRSQVKSLRCV*KEQIA**/MFSI*LISTLISKYIYYAADVL 
EKLFYYIQVQTDKINKKICLFKN1 


5655 


2 


867 


RPPGlRAPRQIJIPAAGRRPDASARPRFRPTVLIiHDPFQLSFPPP 
PLSYPSVFPAVARVLPQRSGDYRAAGMPQLSGGGGGGGGDPEIjC 

atdemipfkdegdpq\rekifaeivnpeebgdladiksslvnes 
biipasngirevarqaqtsqepyhdkarf^pdl^khpdgglynkg 
psyssysgyimmpnmnndpymsngsbsppiprtsnkvpvvqpsh 
avhplt plitys dehfspgshpski psdvns kqgmsrhppapdi 

PTFYPLSPGGGGQITPPLGWQGQP 


5656 


228 


1066 


PRRVP PX»PE FASQPGAAFFHSGRLQRS IjTKDSAGCFSQCRSRAM 
LVLRSGIjTKALASRTIiAPQVCSSFATO 

PSNMNAFMFmBOWnmRTSYSELVGFWSVBFGGRTNKVFHIWK 

YEWFPHRAEVRKAIiANCKENQEQS 1 1 PNtkAR IDRQEIEITYLI P 

NSKI^KPPKEGVYELAVFQMKPGGPAI^iGIlA^^ 

TKWGVPI rrE YGELNRVHVT.WWNRS ADS R AAVRHKGI IEDPI SWG 

GVRESVNYL \VSQQNM 


5657 


105 


1052 


GQRLQSPRVQMPVQPPSKDTEEMEAEGDSAAEMNGEEEKSEEER 
SGSQTESEEESSEMDDFJDYERRRSECVSEMLDLEKQFSELKEKIj 
FRERIiSQLiRIiRIjKEVGAEFJ^EYTEPLGGLQP^LKIRIQVAGIY 

kgfcldvirnkyecelqgakqhlesekiiiiydtlqgelqer t qr 
leedrqsldlssewwddklhargssrswdslppskrkkaplvsg 
p y i vymlq eid i ledwtai kkaraavs pqkrks d \dldpavhs q 

GDPQS S WHCTQDSRLPPADRRTHRPIAVC^ARLLWCCWALPLHL 
ALVWTPPL 


5658 


2346 


3541 


TERRVYNPWPEPDPD\CI QEDPWNIJPNS I KTLVDNIQRYVEDGK 
NQLLIJUiLKCTIXrEIiQIiRRDAI FCX)ALVAAVCTFSEQ 
RYNNNGEYEE$SIU3ASPJCKLEQ\mATGVLLHCGSLI^ PATVKKE 
RTMLBD I WVTLS ELDNVTFS FKQIJ)ENYVANTNVFYHIEGSRQA 
LKVTFYIJ>SYHFSKLPSMiBGGASLR1jHTAI»FTKVI*B^ 
PGSCAAEDLQQD INAQSIjEKVQQYYPJCLRAPYLBRSNLPTDAS T 
TAVKIDQL IRP INAIJ>ELCRIiMKSFVHPKPGAAGS VGAGLI P IS 
SBLCYRIX5AC$MVMCGTCMQRSTI^VSLE#^ 
IMQATDIMRRQGPRVEIIAKNIjRVKD^ 
GDL 


5659 


2 


696 


WKRSGEVSPKGEI^WRGNSGRPKJIGPJVAEAENEDRTLGRLLP 
GNKRSQPRSPliRLI*APQLKAEAAADKGlAPVpppFSSGHSGPC\ 
EREGEGQEGRGRSRRGAH LELKPS PGLRAGAPTDRGRGGPAEVA 
AAGGRRMVQKESQATLBERESELSSNPAASAGASLEPPAAPAPG 
BDNPAGAGG\ AAVAGAAGGARRPXjCG VVEG FYGRP WVMEQRKEL 
FRRLQKWELNTYL 


5660 


229 


853 


PVTMWAFSEI^PLLINLIVSI^FVATVTLIPAFRGHFlAARir~ 
C-GQDLN KTS RQQ I P E SQGV I S GAVFLI II«FCFIPFPFIiMCFVXE 
QR KAFPHHB FVALI GALLAI CCM I FLGFADDVLNLRWRHKLLLP 
TAASLPIJ>nnfrmFGNTTIVVPKPFRPIIX3LHLDLGR*SYlICC 
PYGTYFPvEPFLVLHIIoLQVFLFCLCVFPDPFW 


5661 


2 


473 


I^YPSPCGGIPKLPGLPREAAAALGASFLAEAPLPVYVRGSGL 
AGmVTXX>PXAFLSICFVTE»VFIfl^ 

ADFDVTGPHAPrijAMW^HVELQCQLFPfn^SABlJMBLRWYRCQP 
S LAVHMH ERGMI»4DGEQKWQYRGRT 


5662 


2 " 


1318 


LRKEGRCRRGS^GWAAPAEGI/^RGMTX3\mCLLRSVRFCSSA 
PFPKHKPSAKIjSVREAIjGAQNASG 
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Amino acid segment containing signal peptide"^ 
<A*Alanine, C=Cyateine r Ih=Aspartic Acid, B= 
Glutamic Acid, F=Phenylalanine, G=Glycine„ 
H=Histidine, I=Isoleucine, K=Jbysine, 
L=Leucine, M^Methionine, W^Asparagine , 
P=Proline, Q^Glut amine, R=Arginine, 
S=Serine, T-Threonine, V» Valine, 
W-Tryptophan, Y=Tyrosine, X«Unknown, *=Stop 
Codon, Apossible nucleotides deletion, 
\ -possible nucleotide insertion) 








WraDGSSI^I^^ADSGLDSRELTFGSSVEVQGQLIKSPSKH 
QNVELKAEKIKVIGNCDAKDFPIKYKERHPLEYLRQYPHPRCRT 
NVLGS I LRIRS EATAAIHS FFKDSGFVHIHTPI ITSNDSEGAGB 
LFQLBPSGKIjKVPEENPFNVPAFLWSG^IjHLEVMSGA 
FGPTFRAENSQSRRHLABFYMIKAKISFVBSLQDLMQVIBBr.FK 
AT*TMriVLSKCPEDVEIiC!HKFIAPGQKDRL *HMLKNNFLI I S YTE 
AVBILK^QNFTFTPEWGADLRTBHEKYLVKHCGNI PVFVINY 
PLTIiKPFYMRDNBDGPQKLEGSVA*HSDGUlILLSIWIGQP 


5663 


119 


698 


PAD IGRSTAKTPGPPRS 1.KMDD PRYGMCPLXGAS GCPGAERSLL 
VQS YFEKGPLTFRDVA I E PS LKBWQCH^DSAQQGLYRKVMLSNYR 
Kn^VFIX5IALTKPDLlTCI^QGKBPWNXKRHEWAKPPVICSIIPP 
QDLWAEQDIKDSFQEAI LKKYGKYGHANFQLQKGC1KSVDRCKVH 
KEHDNKXWQCLI PKKKK 


5664 
566<H 


118 


572 


SLSMESNHKSGIXSLSGTQKEAAIiRALVQRTGYSIjVQENGQRKYG 
GPPPGWDAAPPERGCEIFIGKLPRDLPEDRLIPLCEKIGKIYEM 
RMMMDFNGNNRGYAFVTFSNKVEA^ 
CAS VDNCRLFVGGI PKTKK 




347 


702 


WQHLI IIjLHCERTSPAMITSEXPVLQDSTWBTTAHSDAGSEIiE 
ETEVKGKRKRGRPGRPPSTNKKPRKSPGEKSRIEAGIRGAGRGR 
ANGHPQQNGEGEPVTtiFEWKIiGKSAMQRC 


5666 


213 


540 


VSCLPTSCKMITLNNOI^PVPFNSSHPDEYKI AALVFYSCI PI I 

GbFVNITALWVPSCTTKKRTTVTIYMMNVAI, 

FYYAKDEWPFGKYFCQILGA 


5*67 


1 


695 


HPLPSASIX^PSVSLC^SLCVRSAIiFJVWPMLPKRPJlARVGSP 
SGDAASSrPPSTRFPGVAI YLVE PRMGRSRRAFI*TGLARSKG FR 
VLDACSSEATHVVMEETSAEEAVSWQERRMAAAPPGCTPPATjtjJD 
ISWLTESDGAGQPVFVBCRHRLEVAGPSKGPLS PAWMPAYACQR 
PTPLTHHNTGLSBALEIIJ^AAGFBGSEGRLLTFCRAASVLKAI> 
PSPVTTLSQLQ 


566B 


691 


894 


csflfcipdlflqplijgrkeeeavlvggewspsldgldpqadpq" 
vlvrtairc7aqaqtgidlsgctkw 


5669 


407 


1 


DSGAPEGI^PLMSTQEGI^MHAHPOAYTPFIYIjHARKRRGEIGD 

adsrfndryahksaqlyflyfvcwifqdvyyftikeknhfffpk 
argaptkysgspigsptttpptrppsfniihpapboji^mqiiqki. 

NSQ 


5670 


3 


373 


SSECLTMAWIPJ^PIXII^TVSVASYFJGAQPSSVSVSPGQ^AK 
ITCSGDVIAKKYARWFQQKPGQAPVLVIYKin-roPSGIPERFSG 
STSGTTVTLTISC^AQVEDBADYFCYSATraJELWVF 


5671 


280 


524 | 


KFPPKKTPPHIAJMKSAITItVTQFIJj^ 
FKLLKAXKVAKIWGIjRKNKTN^ 


5672 


2 


557 


FVPATPDPGVWLPPSRDPAMAKRSS J*Y XRIVEGKNLPAKDITGS 
SDPYCIVKVDNBPI IRTATVWKTLCPFNGEEYQVHLPPTFHAVA 
FYVMDBDAI^RDDVIGK^CLTRDTIASHPKGKFSLPSOT 
HPPSH^F^SPI^SWBPAC^KJPFIJ^PEAGATFCTPGLCSAACS 
QAWLLLPLP 


5673 


327 


696 


ITVADQ ISHWSAGRI KNRTRI PECIHSSAATTLAGPHTMEGESV " 

KLSSQJTLIQ^OTDEKNQRTITVNPAH^^ 

DVMIVAEDVEIEAHRWLAACSPYFCAMFTSDMS 


5674 


17 


984 


GGGSMKGESTSAVI^GWIXJAiAFQHI^DSDTEGFLLGBVKGE 
AKNSITDSQtTODVEVVYTIDIQKYIPCYQLFSFYNSSGEVNEQA 
LKKILSNVKKNVVGWYKFRRHSDQIMTFRBRI^ 
DLVFTjLLTPS I ITES CSTHRLEHSLYKPQKGI*FHR VPLWANLG 
MSEQI^YICTVSGSCMSTGFSP^VQTHSSKFFEEDGSIiXEVHKIN 
EMYASLQE KUCS I CKKVEDSEQAVDKLVKDVNR LKR E I EKRRG A 
QIQAAREKNIQKDPQEbTtFLCQAI^FFPNSEFIJISCVMSLKID 
MFLKVAVTTTTISM 


5675 


80 


753 


EGSRRG PTRLARLSARAGRIiIl FP PGFS SRIjIHFRGVSECRRP PG 

ksgvpvsapgsk3kwwbkrpgmfslmas ccgwfkrwrepvrkvt 
llmvgi^nagktrataxglyge ypedvaftvg fs ki n lrqgkfe v 
tifdlgggirirgiwknyyabsygvifvvdssdf^rmeetkea^ 
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Amino acid segment containing signal peptide 
(A= Alanine, C— Cysteine , D=* As part ic Acid, B= 
Glutamic Acid, F=Phenylalanine, G-Glycine , 
H-Hiatidine, Islsoleucine, K=Lysine, 
L=Leucine, M-Methionine , N=Asparagine, 
P=Proline, Q^Glut amine, R«Arginine, 
S=Serine, T=Threonine, V=Valine, 
W»Tryptophan, Y~ Tyro sine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=rpossiJt>le nucleotide insertion) 








SKMLRHPRISGKPILVI.ANKQDKEGALGEADVIECLSLEKLVNK 
HKCI, 


5676 


2 


938 


FVSSPPPRPVQPARPGGFGLSGRRSIJLCQVA3TPAHVGVMRSPV 
RDlJUWDGEBSTimTPI.I.PGAPRAEAAPVCCSARyNIAIUVFFG 
PFIVYAIiRVNIiSVALVDMVDSinTMn)NRTSXACPBHSAPIKVH 
HWOTGKKyQWDABTQGHILGSFFYGYI ITQI PGGYVAS KIGGKM 
LLGFGIIiGTAVLTLFTP 3JUVDLGVGPL.IVLRALEGLGEGVTFPA 

WTYVFYFFGTIGIFWFXLWIWLVSDTPQKHKRISHYEKEYILSS 
L 


5677 


1 


1028 


PPRDGFIjBLKiU>SVPI^GPCPLTSI>SR0GER3GGHljVAAARAA 
VTABTHPL PhLAPLAVCQS VKS PAACQVRPRPRAVALPAALGG P 

pnPT TUT T'nUTMCCPDDOBr CT/ffT npy CMPf^Pl wvpr cy T LJT TTTTf 

\jKL> ii FVibTAA i Moo IroISoALiKiUUjoBbbfibUU^ VU 1 Jbii iifflii-iiH 
RKHAGPIVSVWHRELRKAKSNRKLTFLYLANDVI0NS1CRKGPEF 
TREFESVLVDAFSIIVARRADEGCKKPIiERLIjNIWQERSVYGGEF 
I(^LKI^MEDS KSFPPKATEEKKSLKRTFQQI QBEBDDDYPGS Y - 
SPQDPSAGPl^TEELIKALQDLENAASGDATVRQRIASLPQEVQ 
U Vi>t»ijBKX 1 D KaAA KH I, ifriv, rVJJKAUJUKMmi PUTS 


567B 


3 


593 


SSSPP33TPSLPLPFYLIiIiGQLRLQIJ^G'XAHI>SGAGKAAp(^>G 
GSGRTAAPRTPJVDPAAQSLMIMNKMKNFKRRFSLSVPRTKTIBE 

OT * u p»l»P^PUrtT UMDDKTCUT AT /nnt pyi npfvp/IQtffpo y>wnp/ ■» n p 

S LA& ITli^IrWUijHNRIWi^i^J^i'l^KU FFy Kv-Sa 1 J?o Jr i. LiMjUiS 
PGQLSPGVQFQRRONQPJIFSWEVRASGALPRQVAGCTHKGVHRR 
AAALQPDFDVSKRLSLPMDI 


5679 


2 


623 


LNSRVDDFVAVPGAIMDBDYYGSAAEWGDEADGGQQEDDSGEX3E 
DDAEVQQECLHKFSTRDYIMEPS I FNTLKRYFQAGGS PKNVTQL 
LSEOTTAVAQTVNIiAEWLIffKWE 

FDPRKADS I PTEEGETPAWLEGMI AHTTWRDIiKY KLABAH PDCL 

mt nvnnniTD^n' wt.tsd xnrfwiMTrvTToiT t t/r^wr 
MJuNlTl VltV^KVJ^LiKKJK.vr r^VirVlliljVv^IfJLj 


5680 


258 


592 


RRLTSTSEKLQNRNSHTPLESLIHPQPSYKGFGIMFGKKKKKIE 

32K3PSWFEHRVHTGFDPQEQIO?TGLPC^WH^ 

DPSCITPIQIiAPMKTIVRGNKPC 


5681 


45 


869 


LI^UCTIiGSmTKESQAEGYNRSGINNHQABDPRFCPSFCWM 
Dri r Pi>T>rii>TJdiri7K&T>Di>7i>r3cr*Dr!ATfSitfnY?virr^i7Uiyn?r t>t .tf i/tt. 

Kyi Kx^KlJKIViS/iilKx't' X Mr\ji>K^tr\D\Ji. l3FiLrtj.I\l\.L^» v WiMJtrijirij VJC J-li 

FTSAGLHTVYF IAVEDDKI LPLKSAERKPGVKHAPYTSIAGDDP 
PAS<^^QVMHW&AFLALWA 

LCLAS FGMTLLGNFGiTNDEEIHNVGTSLTPGFGTLTCWIG^AIi 
TLKVN I KNEGRRVGI PRVT LSAS I TLCVGPLLHPHGPKHPHVCS 
QGPVGPGHVL 


5682 


39 


622 


PSRS CLGTMRKWRHREVNLP SVTQQDAVCP AP I PSPGLSAQTGL 
QKIHGTI HO}VCPGAPAWPGSPWffiE>K5I^LI>VPI,LLLPGS YGL 
PFYHGFYYSNSANIX5NIjGNGHGKJ3IiI^GVKLVVBTPF^ 
GASVILPCRYRYEPALVS PRRVRVKWWXLSEKGAPEKDVI,VAIG 
LRHRSFGDYQGRVHIjRQD 


5683 


89 


778 


GSCGATAIiITRCLAWS VL I S RIiAMATYTCITCRVAFRDADMQRA 

HYKTDWHRYNTiRR KVASMAP VTAKG FQERVRAQRAVAEEES KGS 

AlYCTVCSKKFASFl^YEimDKSRRHVEI^KKAV0>VNRKVBI4M 

NEKNLEKGDGVDSVD!03AMNAAJC£A1K^ 

EARI^VVAVGTGGRGTHDRDPSBKPPRLQW^^ 

SEDEEHDLC 


5684 


195 


677 


'fWCFRGYIiGPRVIMKALDEPPYLTVGTDVSAKYRGAFCEAKIKT 
AKRLVKVKVTFRHDSSTVE VQDDHI KGPLKVGAI VEVKNLDGAY 
Q3AVINKLTDASWYTWFDDGDEKTLRRS3U:LKGERHFAESET 
LDQIiPLTNPEHFGTPVIGKKTNRGRRYE 


5685 


779 


1262 


LLI^XJFVVHCFlfLFPPFRFSHHMIPGPPGPHTro IPHPAIVTPQ 
VKQEIIPHTI^I^HVKPQHEQkKEQ^^ 
EMRAKVVAECrn*KE&AAINQILG!^ 
RQLHMQLYPGWSARDNYVSPSSIPVALHS 


5686 


128 


1181 


CTWWQVNITLLD INDNHPTW KDAP Y Y I MLVEOTPPDSDVTTVVA 

VDPDIjGENGTIiVYS iqpp^fyslnsttgkzrtthamldrenpd 

PHEAEIJTOKrWSVTIXlGRPPLKATSSATVFVN^^ 
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Amino acid segment containing signal peptide 
"{A=Alanine, C=Cysteine, IfeAspartic Acid, E=> 
Glutamic Acid, F»Phenyl alanine, G^Glycine, 
H=Histidine, I=Isoleucine, K> Lysine, 
L» Leucine, M=Methionine, W^Asparagiue , 
P=Proline, Q=Glut amine, R=*Arginine, 
S~Serine, T=Threonine, VaValine, 
W=Tryptophan, Y=Tyrosine, XoUnknown , *=Stop 
Co don, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








QNLPFVAEVLEGIPAGVSlYQWAIDLDjEGLNGLVSYRMPVGMP 
RMDFLINSSSGWAJTTTELDRBRIAEYQIjRVVASDAGTPTKSST 
STLTIHVI^VNDKTPTFFPAVyKVSV3E0VPR\GSGW3G*AARN 
NDVGI^BL3YFIT(K5NVI)GKPSVGYRDAVWTVVGIJ)RETTAA 
YMLIL^IDNGPVGKRHTGl!ATVFVTVIJ)VraKRPlII^SSYV 


S687 


17 


917 


AAPPAPPDG/PPP/PPPAPPT/PGPAA/APASSCOPRLSAGRAA 
QGDGGAAAVGHVLWPAVGPVRVNPGLQTPVPRPBLLPGP\SSS 
LKSDSSYPPDAGLSDDKEPPDASLPPDPPPLTVP/ADA/PMPVT 
SGCRMPSTSASE/AAGCQGACTHAKGSBTPPPASPQTSBPAPSP 
LP PHLT«3PGMYS3EAKLPNSFSCLG1AGTGAG1* GTASAHGTG 
PPVLPHVCTPSLANPgP\AVGPBASSLPLGVSGIGMSA/SAPIS 
SSPPVAIGSCWLRGIPPPGSGFLCPGRAPGPVPITTHGQEGQGP 
VLDI 


5688 


1 


420 


LTKWDLPGKCYRLLKTG tKHGAMPEQVGVYWYS /CLYDSRKLFF 
*SHMIIRSLL*KVIDDSI^QLPLLRELLL**LNVIDRCIILAYV 
I^VEKTFAITYLKNFTVKVDFSLLGEIPLISMAAILKLWIMKID 
DGYIPAVF 


5689 


1504 


3 


IWA/RK?IGTWTATKPTHRAG*GGAEEYQPPFaPCJBGPRSTSRG 
GKG*GHAVGPGREIGKEGSLPFLGPKAX^F*SASCQRAFKGGAH 
GSTAR KPAPATPGTRHPRTMRTRBVAQGWPAGPRSQFWDQHPHS 
PGEHRPSG\S P LPACPPRAWPKAGAVASATGTG\ PQLPGSRGKQ 
KLFORTREPPLLQAGWAVRKPPWSEAKEGLGOAGRPSGMDS SAS\ 
PQTPGGRGSLKWGLPLYLGPHHDVK*RSDRLG* PP*GGQGGGGH 
GAPSTPGPGGEAW*LPQQTSRPKPGPQAY*GE\GSPGLQCPCSK 
EL*RVPPGSLGPSTQCKyEPTDKHS \GGADAQLEVSTAGSRSTF 
GQELKGPLnAGRLWPGAPSASSSHR*GG*ERARAGAGHRGST*A 
SSKIEQGRPRPGPTSDALADVRGGASS/GPHPWPLPGTLPNR/P 
GSPPPA*ASAGRKGTVSTLGGGLL 


5690 


1424 


58 


ps ppagvc71apap lpllalarrdrrp cs pgaeaapwqtggpaid 
gawrtsvsalrrgatg/apcspoaeaapwqtqgpaidgXdgelp 
*vrsbkaprgcgaegggpgsgpvrrpgagrgahagqgrqqdpep 
tx5lrhrqligaasharhri^rlrpghhqnrhvrrdpqappggpap 
ghaaalpertrgvae p pawahagsdawragr * sort * erar prh 
pttopjvgs\gopgyqppnphpgpssppaap\gprga*gnpolk 
kaprsdrnpsqglrtrirrpbtpdogppspagssasastfrcts 

SI£LIiGP/PGAHNt^APQDR*RGP*GDKRGAPGVAGEDPRPP* 
GNFVR* LLLMP/GVA* RHGTS PFLGPSLGENGGQWDSGNLFGTP 
KG* SHPAFTKST* SMEAEKS YWNHPHR\DRGRQGVRINCLRVGE 
SEMWGPYSAPRPGTVFI»S3FLSPASEEH\PEGSSSFNTPFPPAG 
PEGDPGLNS PGLLP 


5691 


107 


550 


ISNDPSPGYNIEQMAKRGKKLVELPyTVKGMDVSFSGrLSPIKD 
VAHRMLATGECTPEDr^rFSLQVMQ * KTGTESWG*RFYI VEQ^J* S 
GDAPLI FSP YLSLTGNCGFAMLVEITERAMAH\ CGSPGGPSLWG 
GVGVYVLLESVPLSYS 


5692 


1193 ; 


543 


TQAHTRAEKDRKaSVRALRimBRGPPT*RGSHPL\QSVPCIQK 
PSIFSSYPI /GLPQSGGEPGPVGEQQPVRRPEQPSCGPASHPIPL 
TSRSVPPGRGALPPDSLSTRKGLPRPSTAGHKVRESGIUCVPVSQ 
RLNLPVMGATRSNLQP PR KVAV PGPTR * RDQDS KQDFSS KPLQS 
VPGliASTQQT LTP AD S G PGTGGRD ATRAGL PG V lilWGNGVD 


5693 


1258 


1330 


ALTWP VRKGTTWWAQ PHGCSN L VSRARLDLSS RPSQNTE ?QAP 
*QAGPPSSLRPP\SRRR*APEWPKRATaSRCRGLSAPPWPWPAA 
RGB/PGSAPSHAP/PNS?RPSGTRHP/PGPSSRVLYSPSIiPRNS 
PEAIVWRSSRFPLWFPLRCCFWVSGFKDPNPVLRFF 


5694 


3 


1338 


GS KK PAKSLHRRGSGHKS S AGKWGSVTLSTAGALG * KQLHQ * WT 
QRCL\NNLSSEEFNASSSLNSLPSTPTASRRHSrrVLRTDSEKR 
SIjy^GIiSHFSESEEKAPKKLEYDSGSLEMBPGTSKWRRERPBS 
CDDS S KGGELKKPIS LGHPGSLKKGKTPPVAVE'S PITHTAQSAL 
KVAGKPEGKATDKGKI*AVKNTGIiQR5S SDAGRDRLS DAKKPPSG 
IARPSTSGSFGYKKPPPATGTATV^Kr^GGSATLSKIQKSSGIPV 
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location 
corresponding 
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nucleotide 
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Amino acxd segment containing signal peptide 
(A=*Alanine, C=*Cysteine, D=Aspartic Acid, E- 
Glutaraic Acid, F-Phenyl alanine, G=Glycxne, 
Hr=Histidine, I^Isoleucine, K^Lysine, 
I^Leucine, M^Methionine, N^Asparagine, 
P=Proline, Q=Glutamine, R^Arginine, 
S=Serine, T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, X-Unknown, *=*Stop 
Codon, /-possible nucleotide deletion, 
\«posaible nucleotide insertion) 








kpvngrktsijjvshsaepgfiapgarsniqyrsLprpaksssms ' 

VTGGRGGPRP VSSSI DPSLLSTKQGGLTPSRLKEPTKVASGRTT 
PAPVNQTDREKEKAJCAKAVALDSI^n!SIiKSIGSPBSTPKNOASH 
PTATlCLAELPPTPIJUiTAKSPVia?PSIJu>aiDKVNSNSLDLPSSS 
DTTQCI 


5695 


3 


1338 


GSKBPARSLHRRGSGHKSSAGKWGS VTLSTAGADG* JOQLHQ* WT 
QRCZ,\NNLSSEEFNASSSLNSLPSTPTASRRNSTIVLRTDSKKR 
SLASSGI^SWFSESHEKAPKKLKYDSGSLKMEPGTSKVfRRBRPES 
CDDSSKGGELKKP I S LGHPGSLKKGKTP PVAVTSP I THTAQSAL 
KVAGKPEGKATDKGKLAVKNTGI^RSSSDAGRDRI^nAKKPPSG 
lARPSTSGSFGYKKPPPATGTA'rVMQTGGSATLSKIQKSSGIPV 
KPVNGRKTSLDVSKSAEPGFLAPGARSNIQYRSLPRPAKSSSMS 
VTGGRGGPRPV3SSIDP3Lt»STKQGGLTPSRLKEPTKVASGRTT 
PAPVNQTDRBKEKAKAKAVALDSDNISLKSIGSPBSTPKNQASH 
PTATKLABLPPTPLRATAKSFVKPPSLANI>DKVNSNSLDLPSSS 
DTTQCI 


5696 


3 


1338 


UD^brAKDiitUC^aUHJU>oAbJVnUbvi JL»TAuAItG* KQIiHQ* WT 
GRCL VNNI^SSESFNASSSIiNSLPSTPTASRRNSTIVLRTDSEKR 
SLABSG 1>S WFS ES EBKAP KBGLBYDSGS LKMB PGTSKWRRERPES 
ODDSSK^ELKKPISLGHPGSLKKGKTPPVAVTSPITHTAQSAL 
KVAGKPEGKATDKGKLAVKimSLQRSSSDAGRDR^ 
IARPSTSGS FGYKKPPPATGTATVMQTGGSATLSKIQKSSGIPV 

kpvngrkts lovsnsaepgflapgarsniqyrslprpaksssms 
vtggrggprpvsssidpsllstkqggltpsrlkeptkvasgrtt 
pap vnqtdre kekakakavaldsdni s lks i gs pestp knqash 

PTATK7«AKTiPPTPT.13 a*FAVfJK*Vtf DDCT.RMT.'nirtrxicNicT rvr t»oc»ci 
DTTQCI 


5697 


1147 


47 


PSEAL3 PPACPSAPAPRRS I I SRLFGTS PATBAAP PPPBP VPAA 
QGPATVQSVBDFVPDDRI^RSFLEDTTPARDKKKVGAKAAQQDS 
DSDGEALGGNPMVAGFQDDVDLEDQPRGSPPLPAGPVPSQDITL 
SSBEEAEVAAPTKGPAPAPQQCSBPETKWSSIPASKPRRGTAPT 
RTAAPPWPGGVS VRHS P R BTH Q ^JTff P P AITMRDT 1 W CO & C c c i? c rv n 

EGPIAAOMLSFVMDDPDFESBGSDTQRRADDFPVRDDPSDVTDE 
DEGPAEPPPPPKliPLPAFRLKWDSDLPGLGLEEAGPKESSEEGK: 
EGKTPS KBNnKKJOCKK3KEEEEKAAKKJC3KHKKS KDKEEGKEERR 
RRQQRPPRSRBRTAA 


5698 


2 


666 


GAEAAEPQEDI»PPIiSQSSRFFQEQQKMNBCSUJPVS FKDVAVDFT" 

QBEKQQIiD PBQKI TYRDVMLENYSNLVS VGYH I IKPDVXSKLBQ 

GBEPWIVEGEFIu^SYPDEVWOTDDLIERIQEKENKPaROTVFI 

ETLI*R/ERGNVPGOTF15VETNPVpSRJOJm3ISI^SCKR\GF 

MASSEYISSIXJRYARMKADECSGCGKSIJjHIKLBKTHPGDQAYE 

FNQ 


5699 


2 


1448 


rvrqppglwvrrtvpamocpaglsrvpgyag/dpslpsfrgprd 

EAAHRGTI QT7ARHTR KLYVQGPASG P PLPR VSTQVA I *DEKPLA 
RPS/GRTNAPFPQGQKPAGKAAPGPAAAGRVAMR\PGHPGLLAS 
DSOSSSSKG9GWBTPVPWS *AQPGWVSGLLLLGDPSGPGSL*RS 
TWLWGGARGPEGSGVRa^GWPSGCSDIGWAIJ\GWNHS *HLDPNT 
WTQKWTGE/SPAPGEEG\VAPAPRGPTAEHGHCBL*ri'ESQY3NN 
VPII^QNPEGAliRSRRTEPAGMVPPTRI£E+DIX5*TAAPASGGAP 
VSTPl'WAQTP/LWASLGPTDPOGKPGCRPPCALPKPAGPERSA* 
GGSLGCR/ SMIiPASSGPPPAPGPRRLAAGAHTSASARCPPAAAA 
GWQPRRPGFAGRAAIiPGPPHP PSS *REIiGGLPGPGW* TLDPLPA 
HPAHPPGSAPPWGALGGWAAARASLPWS PSU3.SFPAVTPVAGL 
FPPGRG 


5700 


<m 


597 


l^HJO^EINIY*RRSNIffi<NSKSBSHIiNQDHSFPPPTPNSARS 
KlJISTGTAIQTOliPIiSGAPRQRAVFSGRTICQKFSSCLQCAYLD 
E*CSIASSLIKAILRVSVLSB 


5701 


59 


410 


ifekicsdtqefispkinpqicswlifdkgak/nhatGkdslfn 
kwswknwlstcr*mrpgpyftpytkinsk* ik/ danircetvkl 
leentgem^dtgix^fldmtpktqptkqk 



361 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
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Amino acid segment containing signal peptide 
(A^Alanine, CoCyateine, D=Asparzic Acid, B= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I*=Isoleucine, K=Lysine, 
^Leucine, M=Methionine, N=Asparagine , 
P*=Proline, QrrGlutamine, R=Arginine, 
S=Serine, T=Threonine, V-Valine, 
W^Tryptophan, Y=Tyrosine, X -Unknown , *-Stop 
Codon, /=possible nucleotide deletion, 
\ -possible nucleotide insertion) 


5702 


.3 


1517 

■ 


ETPVDPSQCGGIPSDSPHPVITPSRASBSSASSDGPHPVITPSR 
ASESSASSDGPHPVITPSRASESSASSDGLHPVITPSRASESSA 
SSDGPHPVITPSRASESSASSDGPHPVITPSRASESSASSDGIiH 
PVITPSIUISESSASSDGPIIPVITPSWSPGSDVTLLAEALVTVTN 
I EVINCSI TBI BTTTSS I PGASDTDLI PTEGVKASSTSDPPALP 
DSTBAKPHITEWASABTLSTAGTTESAAPHATVGTPLPTNSAT 
EREVTAPGATTLSGALVTVSRNPLEBTSAL3 VETPS YVKVSGAA 
PVS I EAGSAVG KTTS FAGSSASS YSPSEAALKNFTPS ETTI/rMD I 
TTKGPFP^SRDPLPSVPPTTTNSSRGTNSTIiAKITTSAKT^ 
PTATPTTARTRPTT\A*VQVKMKVSSSCG*VWLPRKTSLTPEWQ 
KG* CSSSTGNSTPTRLTSRSPYCVSGEANG/ PSAAARHVPYAKR 
GCCP* PGPPPTIXSCVTVrJiGTQKVPMKGSMSKPIjTPDVATGPS 
LTSTGVYVWGGASPVPRGVLGLTLAHVLCF3 KEKT 


S703 


14 


| 1117 


HHKDSHSQGLPRTQECARPELRPLIjCPRALHPVTRLSYRCPWQA 
PKAGIGTKAKPSESHLKmPGWPSLDROGEPATLGTGTGHCSDS 
RILRWHP*HTAAR*PRWRRI,PSSHRWTRiILGVr>RVQDKS* *VSL 
DPSCRPRPLRTC* * yGMRSVASSSNPPPGWSGPGASVFPARPVS 
ALPTGPKCW*APRGRTRQPCGWPRLSSPHATADHGPGCPI>SPSR 
GS WETAPGS * WCPl^*AARWTGWRTASGASAGLGRAADRPSAWA 
RRVAGI J.PGQGLTVRR*H* TAGAPASVRSS QGATRSPAPGGDQ C 
AOGRGPGSC*HPPPWPVSPSSPVPCPSGR*HLRGPIiLSAARPRA 
AGWPRHSPHDTQTPBP 


5704 


23 


562 


GDYEFDSPY WDD1SQAAKDLVTRLMEVEQDQRITAEKAI SHBWI 
SGNAASDKNI KDGVCAQIBKNPARAKWKKAVRVTTLMKRLRAPK 
QSSTAAAQSASAaixrATPGAAGGATAAAASGATSAPEGDAARAA 
KSDNVAPRRP * I*PPQPQME VPPQPLMAVSPQPPMEASLQPLMGK 
SPQP 


5705 


23 


*62 


GDYEFDSPYWDDISQAAKDLVTRLMEVEQDQRITAEEAI SHEWI 
SGNAASDKKI 1GX5VCAQI EKNFARAKWKKAVRVTTI^KRLRAPE 
QS S TAAAQSAS ATDTATPGAAGGATAAAAS GATS APEGDAARAA 
KSDNVAPRR P * LP PQPQMBVP PQ P LMAVS PQ P PMEASLQ PLMGE 
SPQP 


5706 


1161 


610 


QIjGRFXAQDTVAI RKVKEVPGTG AMRHW1 I*FTHKED*GGQALD 

DYVANTDNC^IJaDLVRECERRYCAFltN^GSVBEQROXX^LI^V 

IBRLGRKREGSFHSKDLFLDTUJLLQRTGAGAC^EDYRQYQAKVE 

WQVBlffiKQBIJlBKnSSNWAYKAijIiR^ 

LFFIIFIxF 


5707 


28 


609 


GSPAPTPGFRRRPGRGTPSPGTRHHQGRABPEPDAPERAPJbRR* 
MFAIQPGLABGGQFLGDPPPGLCQPELQPDSNSNPKASAKDANE 
NWHGMPGRVEPILRRSSSESPSDNQAEQAPGSPEEGVRSPPEGA 
BIPGAEPBKMGGAGTVCSPLEDNGYASSSLSIDSRSSSPBPACG 
TPRGPGPPDPLLPSVAQA 


5708 


44 


1925 


SFSWEETISPCFPKMPAEPWHIiSPVSLGAAGHPGQPRPYLDLPA 
QASVSRPHBRA*GEAVSLSLSSGDVCGHTDGGGAGSDPQAKPKP 
PRCPFTAMPSPRTKQKVRHKVCLLIAIRYSDIPSDVSKAP\GPA 
GNPH7)RSS1AA+I^RRAGAGSLCLSASLLPPSFSLGAPGAPSPL 
RVS PASGGPRKEGRQGS GG * AGGGGP \ARTHADLPCVG FVCS PP 

JjLK*SDS pvkqlpa\sgqgsgasmppvgs sdilrprptsvsgtg 

RAAG*CSWQPAACCTPRSQ*WAVARSPSRCSRW*RQSGR*RG*S 
S RRRRGP* AAGRSTPAVP * PCS * GGAGRRAYACRTGWGYAPS R * 
LEPSGPTSGSAL* TWASHSTGA* *SRIiCGTAGTGPLCSQSSRS * 
AG * RCCCTAAS PCGGSGPSHPGS PSAHCLSWSGGRTQPRAPSAH 
G3GRAMGSRCVCTCn?GLPCPGI PI^GASPGGSGBTGAGRGHTLK 
AARSRLS PRPGSGSRGSY*SHNDNWGTWPAPPSAGHLIjVGG*NS 
QRTSSDH*YTGTRRPWAGPGTRCSTAPSRAAPPVSRCRPPPPPP 

pprpprlpaaas/sggasgspaascscscrapakpass/gbapa 
ppprpeppppparrp 


5709 


2 


2031 


ITI^PLPO^imCXNWrEAATPI^IYrJCARNmAGGl^KEIaEIS^ 
IJIQIVVRWGAVVMRAGMGGCRCWGVMAPFAPR/NALS flvndcs 
Ij I HNOTCMAAVFVDRAGEWKIjGGLE YM YS AQGWGGGPPRKG I PE 
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residue Of 
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nucleotide 
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amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, O Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G^Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M-Methionir.e, N^Asparagine , 
P^Proline, Q^Glutamine, R=Arginine, 
S=Serine, T-Thr eonine , Valine, 
W tryptophan, Y»Tyrosine, X=Unknown, +»Stop 
Codon, /^possible nucleotide deletion, 
\ -possible nucleotide insertion) 








LEQYDPPKLAUSSGRVVREKRSADMWKLGCLIWKVFNGPLPRAA 
ALRNPG KI P KTLVPHYCELVGAN PK VRPN P AR FLQNCRAPGGFM 
SNRFVETNLFLEEIQI KEPAEKQKFFQELS KSLDAFPEDFCRHK 
VLPQLLTAFE FGNAGAVVLTPLPTWGKFLSAEEYQQKI I PVWK 
MPSSTDRAMR IRLLQOMEQPIQYLDEPTVNTQI FPHWHGFIiDT 
OTAIREQTVKSMbLIAPKLNBANI^ELMKH^ 
RCNTTVCI^KIGSyiiSASTRHRViTSAFSRATRDPFAPSRVAGV 
IiGFAATHNI»YSMNDCAQKIIiPVI»OT 

FLS KLESVS EDPTQLEE VE KDVHAASS PGMGGAAASWAGWAVTG 
VSSLTO KL» I RSHPTTAPTETNI P QRPTPFJGVPAPAPTPVPATPT 
TSGHWETQEEDKDTAEDSSTADRWDDEDWGSLEQEAESVLAQQD 
DWSTGGQVSRASQVSNTPTTNPPNPQSPTGAAGKXRGLIiGTGLA 
GAKLPGATS * R YTAGQRV 


5710 


1 


562 


IPGSTISC^ELMARMAKTIDSFTQNQTRLVVI IDGLDACEQDK 
VLQMLDTVRVLFSKGPFI AI FASDPHI IIKAINQNLNSVPSGFK 
\LNGHDYMRNIVHLWFXjNSRGIi/RQ/IX3KNFS *LQQQMBTFHA 
QILOOTRKFJ>TEEFHRTAI/;R*QNLVARQPS IDG*DAIGFET.YV 
C I A I Q FNTNKDDAT 


5711 


1526 


1130 


RRHPFQWTTVTQPJVFSHHDVAFT8TPVIiFYPDSAQPFIVKSE33 
SQIAKAVLSQQRPSI*FHECAFKOPFS*SLQRHTI^ 
I^EBROJIbFF>SS/I>m:PHNbK*/FEIHEHIiGSHEGIIWTLFFIJ, 
Qlli 


5712 


3 


1391 


grklfqsldiserlkfi,lti*dcvddtl i vlaerhgcu)! i kelp 
btvi dtjt^cltfhpskrptpdelmkdkvfsevsplytpftkpa 
slfssslrc^ltlpfj^isqix:ki>inndylafjrsieevyylwcl 
aggdi^keiivnkeiirskppictlpnflfbdgesfgqgrdrss/ 
tfr* yhwdiwmpakk* iercwgrs iiip itlkmtslilpysnsn 
nelsaaatlpl i irekdte yqlnr i ilfdrllkaypvkknqrwk 
earvdi pplmrgltwaallgvegaiiiakyt3aidkdtpiptdrqi 

EVDI PRCHQYDBLLSSPF^HAXFRRVIJCAWVVSHPDI.VYWQGLD 
SLCAPPLYrJNFNNEALVYACMSAFIPKYbYNFFLKDNSHVIQEY 
LTVFSQMIAFHDPEIjSNHIjWEIGFI PDL YAI PWFLTMFTHVFPL 
HKIFHLW\DTIJLLGEFLFPII»YWE 


5713 


<534 


284 


PVCAVPVDRWPVLPREDQEGQQb* AKLPRDFRR* FQ I LGPMEGH 
TACRCS RRG AQVQHLPRBDI RAAE * DPHLREVW PGLPTSSATS P 
* RAVLTS PCSHLGSADAASSHHItCGVSFH 


5714 


212 


613 


WGLGIXSPTMSSLGGGSQIIAGGSSSSSTOGSGGSGSSGPKAGAAD 
KSAWAAAAPASVADDTP P PERRNKSGI I SEPLNKSLRRSRPLS 
HYSSFGSSGGSGGGSMM^ESADKATAAAAAASLLANGUDLlAAA 
MA 


5715 


131 


1979 


ESASG^KRS«a,II.rLKLEI£GSAPKKTSARP^SLWI,pPHSQE 

OTPPASKl^GGGGLOTX^IiHP VPVTAAS PLPRWCLFGAVAX\ 

GIiPGP*I»CP3GAA/ GGIjQRGPGLSPLGAAGKVSCLHPPSMVENN 

DSTCHEHHEGILAARVTPVP\SGKPGRVIJ^ 

PRPPGS/SDLDGPRPQMHXRAPPAAHGGPVNTPHGGEPJCTFMSS 

QIRRXETKPL*PJCTPAG\NNYQSNSIPVSQSP^ 

TOAPSGPjGDAGKPTPGHG\LPXASVILTPNCPCSTiAGGQ* PPGL 

YPKTPKQRRWRRPL/LLGPSQ*GERQSTC+EV\GALGEPVR1PG 

L* PDLSCI LSNGSKHRREGLSFPRSLGPGRRGPAGbQSLGCSPT 

PKNTAC3ISSGHVALQAGHDSARDVGSGHVALO^^ 

VWR Wl PI*E * LGLSRETGQATRRGLVW I S PGRAAAACVACAQALE 

EGPLRLPGQDRG31QPCSHCPGPAAGQPBPGAGAPCRE/GG*DPT 

GliT/GVPGTDPKRGGRKPGX^GQETQGPTVHSGPESPLQPKP+E 

RQE/VGAGASSGVGLSRGRAGGPSSAWKVAAMIJJULRHGSHSEJj 

TDIiTEAQTSQH 


5716 


1711 


1370 


RVFSLLCEGPGHCYO^AVCREACAAASPGLDSAAEPHRLCEHTD 
*LPK*GPGYIQHFHCDSNILCILYKISFNLFSYSF*GVARYAC* 
RCPLVL*SGFFTI I VGGYSCCMPLKT 


5717 


44 


1489 


LPTEAIJIESEWVSEYGKCGPRGLVPE^STSPLPSSVI)TBD6IJ> 
EGPGALVLESDIiLGQDI^BBEEEEEEGlX^DO^^FERDSE 
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to first 
amino acid 
residue of 
t amino acid 
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Predicted end 
nucleotide 
location 
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amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=*Aspartic Acid, B=» 
Glutamic Acid, F^Phenylalanine, G^Glycine, 
H=Histidine, I=Isoleucine, K**I»ysine, 
L-Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q^Glutaraine, R^Arginine , 
S=Serine, T=Threonine, V=»Valine, 
W=Tryptophan, Y=Tyrosine, X=tfnknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possil>le nucleotide insertion) 








GDSIfiARPGItPYGLSDDESGGGRALSABSKVEEPARGPGSARGE 

RPG PACQLCGGPTG EGPCOGAGGPGGGPLLP PRIMS CRLCTFV 

SireSSHLKRHMCmiSGBKPPRCGRCPYASAQIiVNLTRHTRTHTG 

KKPYRCPHCPPACSSLGNI^RHQRTHAGPPTPPOPTOGFRCCTP 

RPARPPSPTEQEGAVPRRPEDALLLPDLSLHVPPGGASFLPDCG 

Q\ CGVKGRASAGLDQNHCQS /S LFPWTCRGCGQKLE KGKGSRIX3 

AAMCGRCWRGEAGGGASGGPQGPSDKGFACSI^PFATHYPNHLA 

KHMKTHSGEKPFRCARCPYASAHLDNIjKRH^ 

CPYACGNLAl^KRKGRIHSGODKPFllC^ 


5718 


120 


284 


VAHAI^LPAESYGbn)VSMTHPULPPTQLAWDLCRTCLPI>SYNFT 
S**SrADPLHI, 


5719 


48 


428 


ELNNGPFQNPLCNGGNIjAVTGSWADRSPLHRAJ^ 

LSQGYNVNAVTLDHVTPLHEACMDHVACARTI^EAGANVNAIT 

IDGVTPLFNACSQGSPSCABLLLEYGAKAQP\ESCLPSP 


5720 


1 


1051 


LQAFRNASEVPMVLVGTQDAISAA\NPRVYRRTSRARKLSTDLK 
\RCT\YYE\TCGGTYGLQMHSVSFQDVAQKWAL\RXKQQ\IiAI 
GPCK\SLPH\SPSH\SAVSAASIPARAPIHQGHE/SGGG$AFSD 
Y\SSSVPSTPSISQREURIETIAASSTPTPIRXQSKRRSNIFTS 
RKGADP\DREKKAAGCKVDSIGSGRAIPIKQGILriKRSGKSLNK 
EWKKKY\mfCI^rxLTYHPSI^Y^fQ^^IHGKEI DJjLRTTVKVPG 
KRLPRATPATAPGTSPRANGr^SVKRSKTQLGGGTGAPHSASSAS 
LHSERPLSSSAWAGPRPEGIJIQP^C^SSADQWSEATTSLPPGM 
QHPASG 


5721 


97 


492 


RHSSPCCSIJIRTERSSKAAVST/TTVQQFKRFIENYI^IGCVA 
VPYAI AGGL FLERAYY YAFAAH1 ITG I TDTTRVG I I LSRGTAASI 
SFMFSYIl^TMCRNLITFIiRETPLNRYVPFuAAVDFHRLIASTA 


5722 


88 


1043 


VALDVLAGS SPGGGMAGAIiljGPRVHGI RAVLRVARGGVQAPGAP 
GSI jGVSHAAAPPARPQGAAQS PHRGRRHGGGGAGLP PPR S PRFP 
QES VPASTSTARG PRRVSRRliPPQHPG PRGRRRR PGAGVGAPRR 
GRARGQAGLLGRQGQGGRGAE^ERAAIiQARRGRRPGPEPSQSCX? 
GRPRRAAAAPGRAPADPQPPAPRPAPAPDVRPPADAPAPAPAPA 
PPPPPHLGAIiTAGSGEERQ3QPRAKTLRI>GRGAPLP\PRAERGG 

RPKQAEQQQN pkrptppargpqssgdpamlpqraglrtgglagt 

KS5TREIPEMI 


5723 


88 


1043 


VALDVLAGSS PGGGMAGAIJjGPRVHfi t RAVLRVARGG VQAPGAP 

gslgvshaaapparpqgaaqsphrgrrhggggaglppprsprfp 
qeswaststargprrvsrri^pqhpgprgrrrrpgagvgaprr 
gpju^qagixgrqgqggrgaereraaix)arrgrrpgpepdqscx5 
grprraaaapgrapadpqppaprpapapdvrppadapapapapa 

PPPPPHLGAI»TAGSGEERQSOPRABTLRIjGRGAPLP\ PRABRGG 
RPKQAEQQQ\PKRPTPPARGPQSSGDPAMLPQRAGI.RTGGLAGT 
KSSTREIPEMI 


5724 


3 


1841 


FTNEAPPAPLPDASASPl^PHRRAKSLDRRSTBPSVTPDLLKFK 
KGWLTKQYIHX^WKiaWFAIJ^ 

I^ACYuVTBYPVQPJmJFQIHTKF^BFTI^A>TrSGIRRNWlQTI 
MKHVHPTTAPDVTSSLPKEKNKilSC^FBTCPRPTEKQEAELGEP 
DPEQKRSRARE\RRREGRSKTFT>WAEFRPIQOALAQERVGGVGP 
Aiym\DPWRPEAfiHGEI^^RARRREKPJiKRFGMl^TDGPGTR 
DAAIiRMEVDRSPGLPMSDT.KTHNVHVETEQRWHQVEl-rPLREEK 
QVPIAPWLSSEJX^DRI^THRLrSIiLEKEIiEO^KEASDLL^ 
NRLU3IX)I»RVAI^EQSAP^GYVLQATCBP^PAAME^THQKKI E 
DLQRqHQREIJSKLREEKDRLLAEElT^Tl SAIEAMKNAIIREF^ 
RBbBKSQRSQISSVM$DVFJUiRRQYl£BX.QSVQRELEVI»SEQYS 
QKCLENAHLAQALEAERQALRQCQRENOE LNAHNQELNNRLAAE 
I TRLRTIJjTGDGGGEATGSPIjAQGKIJtAYEIiEVPSGAR PCLTQLC 
TQEPQGSAAWPLSYRVVGGTDLRQQESOGPGRSKSPEGGEBQ 


5725 


3 


1049 


VKGHSEETSQSPNRTEP:-IDSDCSVDLGISKST3DLSPQKSGPVG 
SVVKSHSITNMElGGLKIYDII^I^\DI^SHX^PrJC/FTSAVDG 
KMIVRSKAATLLYDQPLQVFTGSSSSSDIiISGTKAIFXFDSNHN 
PE / GAKYNKRPHlCRAHNIaHLKYMYLHS 1 1 SNTVAV\ RSQRHFVA 



364 



WO 01/53312 



PCT/US00/34263 



SKQ 
ID 
HO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue o£ 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A= Alanine, C= Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H^Histidine, I=Ieoleucine, K=Lysine, 
L-Leucine, M-^iethionine, N-Asparagine, 
P=Proline, Q^Glutamine, R=Arginine, 
S=Serine, T=Threoniae, V=Valine, 
W ^Tryptophan, Y^Tyxosine, X=UnJmown, *«Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








LQTKS PNRP0QF5SSAPS / VDQRAQ/ INQS YAKHSANMNFSNHN 
fJVRANTAYHLEQRLGPARHGEMWAISPNDRLI PAVTRSTI QRQS 
SVSSTASVNIjGDPGSTRRAQIPEGDYLSYREFHSAGRTPPMMPG 
SQRPLSARTYSIDGPNASRPQSARPSINB1PERTMSVSDFNYSR 
TSP 


5726 


i 2 


| 486 


SRSbSMWKNSGLPASSHSS KLPVTVGFSGCVKRLRLHGRPIjGAP 
TRMAGVTPCILGPLHAGLFFPGSGGVrTL/BSVGAGlPGPSRAG 
j O^PGGSGBGPPLSSPSQPI*PADLPGATLPDVGLBLEVRPrAVT 
GLIPHl^QARTPPYIXJLQVTBKQVLLRADDG 


5727 


21 


221 


RP I L I LKBTRELP WATG YAE VIN AGKS THNE DQASCEVTjTVKKK 
AGAVTSTPNRWSSKRRSSIi?NGE 


5728 


2 


877 


gtrngqfeprrgrawbgsagglrapgaaaggpgvqprgsg/lpg 

NAIRAGVNPGRGPASPFWDIiSLPWDLWPPPTDHAPGAPDFPAVE 
GR\PWAGGRPPWPV5GVLGSRVCX3PLYSTSPAGPG/SGGLSPSQ 

ggpagaggdag/lpgrcpsapwragsrpaascpdwipgpqglwii 
hrmpts/gppsqigegaeqgdegvaixapqiqckn/gaedppaed 

BPPQVPRAOBEDAVPABEGPG^TPBTOAJ}0^mERPBAia»AEGGa 
KGSPRRIiADPQDIjPAGOMS LAPP FP PVAAVIRSNX 


5729 


1 


1525 


AGGARKVLTl^IK^AGPVGAHKWNQQDAALGRATDSkEPPdEL 
QJJJAAIAWQGKLTTHKEEIVYPKNPYW 

KS IPNGKI^SPI^^TTPKPIJPTEASIRVWSDFI^VHIiHPRSI 
CM IQKYNHIX3BAGRLBAFGQGBSV]jKEPKYQBBIjBDR1jHFYVEB 
CDYI»OGPOIljCDIiKrjflP5W»Vf?AirA AKTiT irnYRY*!r?Ptf3TT*rttrcT T.D 

GPYHRGKAQRNIYRLLNTAFGLVHLTAHSSLVCPI^LGGSLGI^ 
PE PP VS FPYI^YDATr.PFHCS AILATALDTVTCS \ Y RLCSS P VS 
MVHL\ADMLS FCGKKWTAGAI I PFPLAPGQSLPDSLMQFGGAX 
PWTPLSAOGBPSGTRCFAQSWLRGIDRACHTSQLTPGTPPPSA 
IJlACTTGF^ILAQYI^QQPGVMSSSin^LLTPCRVAPPYPHLF^ 
SCSPPGMVUDGSPKGAAVESVPVPG 


5730 


1258 


1713 


KKFQAPARETCVBCQKTVY PMERLLANQQVFHI S CFRCS YCNNK 
LSLGTYASLHGR1 YCKPHFNQLFKS KGNYDEGFGHRPHKDLWAT 
KIETEGFWERPR2TFEKC^RPIiKSPGGEDCPSC*GGCPGSNY*AQ 
GSSSRE KGGQAS WN PKliRVA 


5731 


122 


443 


RSHRGELIPKDSCTMRKPPRRJ>KKRRQG/CALPQGCIiTFKDVAI 

BFSLEEWKCLNPAQRALYRAVMIJ^YRNLK^ 

KPGRGRGKQRRQEWFFLRVY 


5732 


226 


772 


PPSRSCO^PRRKSRRRAlIVT^LVCXSFTSFSFSLPLYIiCGCM 
PERTreQIiOJADWAPDFGPSSFWSW^ \N 
liGTKEQAHR IAIJJIiREQGE GKDQPGRIiKKVCXJIGWYLJDEKNIiA 
QVSTNLU>PEVTALinVYEETC5REAQEI^IJPW 
ALLDAA 


5733 


1 


460 


PAI^BVNANAIJWGKQYEKDARTI^BFTSGVNIJrESP 1 1 YRDES~~ 
MRTACS PIX3LCSIXINGI1jELKCPFTSRDFMKFRLGGFEAI KSAYM 
AQVQYSMW VTRKNAWYF7WYDPRMKREGLnYVVI ERDEKYM \AS 
FDEI\VP\EFIGKMDEVLSRDPM 


5734 


3 


968 


RaaSPESLTSr^^^>LTTA^^FVl^PAYSK^fRAYAtFyIVFTVI ' 
GSLFLMNLLTAI I YSQFT^G YLMKSLQTSLFRRRLGTRAAFEVLS 
SWGEGGAPPO^VGyKP<^LI^VU)KVQIJ)SSHlCO^MBKVRSY 
GSVLLSAEEFQKLFNELDRSWKEHPPRPEYQSPFLQSAQFLFG 
IIYYFDYLGNIiI7\LAN]jVS I CVFLVLDADVLPAERDD FI IiGI LNC 
VFIVYYIiLEMLIJtWAIjGLRGYkS 
TL\VCTDCHTQAGGRlllW/RIJ^LWDMTRML^ 
SMKPMAWASTVLGL 


5735 


2 


540 


FFTPCVARAPNFPDQAl^GXAAYSJ^RVGGG 

ATPRQL YK/S SNMTQRWQRRE I SNFEYI^ PXOT-LRGRTYNDIiNQ 

YPVFPWVLTNYlSSEEIiDl/riiPGNFRDLSJ^ 

RYETWBDDQSPPYMYNTHYSTATSTLSWLVRrvS IFIBLACLWY 

LKILT 


5736 


1 


382 


GTRPSTKKSGYSPQQVAVXJJCKGHQKENTAVAHSNQKADSAAgV 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanxne, C=Cysteine, D=Aspaxtic Acid, E- 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H^Histidine, I=Isoleucine, lfcLysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P-Proline, Q=01utamine, R=*Arginine, 
S=Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyroaine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








TARLSVTPPNLLPTVS FPQPDLPDNPVYSTTTKKLASDIJUVNKN 
QES* * ILPDSGIPIP *T*TS YLQSTTHLRRAKLPQLLRR 


5737 


290 


1041 


KACLHLLSS FLTSNFLFNPLLPDSLYSVEARSQRANLGPCRRfCR 
LQTLMRI*AAG FQYSSHKDPSLSAKBKETDYHNEARGP WPGWVG * 
RTADGSCGRGPDGAHH PGPKS S SWRASRLLPGLGGS HHLDA YVG 
RDLECGTPAPLQIaEI PPQPRGHPAPIPTGQAGPRDSGPGASP*V 
ETRPIjTIX;RR* PGVRPVGWTPAHPAGTLRPRGAVKPSVSACGKW 
APS PTSQGCCEGRCDAVPKHRAWRTPLCSQ 


5738 


a 


460 


DTIiSLKCTLPETIfPMTPS P* IiS FL *FPGLARAKSIPTKTYSNE V 
VTLWYRP PD I DLGSTDYSTQI DMW *GQVRVWOGPCGKGGGLVTT 
ATQPAAFLPTVPSLPRGVGCIFYEMATGRPLPPGSTVEKQIiHFI 
FRILSEBAWALCAVBTHR 


5739 


1 


1222 


SPQRRGIRWNVHTLHPHPRAVWAGIGRGHGS * ALLGRARAPALC 
FPTLI^FLESI£PDI^AIJ2AMGIiHI^7WU5PGTHPAGISDI»IiABV 
SAEVIX»VPGYI£SPQSITDTCLYIFTSGTTG1^^ 
UKWFYQIXX5VHQEDVIYIALPLYI^ 

VLKSKFSAGQPWBD CQQIIRVTVFQ YIGKI*CR YLVNQP PS KAKRG 
HKVRIAVGSGLRPDTWERFVRRFGPr^VI,ETYGIjTEGNVATI]^ 
TGQRGAVGRASWLYKHI FPFSLIRYDVTTGEP IRDPQGHCMATS 
PGEPGIiLVAPVSQQSPFIiGYAGGPBIJUJGKIJJCDVFRPGDVFPN 
TRDLLVCDDQGFLRFHDRTGDP FRWKGENVATTBVAEVFEALD F 
LQEVNVYGVTV 


5740 


265 


231 


PAYWLKVPTU^^KTDIJIEK7^HVSAQI^EVRGIAGALWM*A 
YVYFJ^VYN*NISRfWHAl^QKRHPAGI^SSMAI^LNPCtiGMI/4A 
IjQSELHKLYDEETQSWVSGSACGGYP 


5741 


1 


650 


PRKTMRRGVLMTUjQQSAJMTLPLW IGKPGDRPP plcgai pasgd 

yvarpgdkvaarvkavdgdeqwii*aewsyshatnkyevddide 
egkerhtls rrr v i plpqw kanpbtdpealfqkeqlvial ypqt 
tcfyralxhappqrpqddysvlfelxrs yadgys pplnvaqryw 
acke pxkk * crlads ps pndtgqdsrgrag i kh i pplkkk 


5742 


2 


362 


TQSVKEILKRNPUVNLTDKIXSNTAJLMIASKEGHTEIVQDIxLDAG 
TYVNI PDRSGDTVItlGAVRGGHVE I VRAIiLQKYAD IDIRGQDNK 
TMiYWAVBKGKATMVRDILCK^NPX)TEICTKW 


5743 


2 


415 


GKTPEG I DA I EE IE IDLEETEREIS P QENGLE EVXPIjGEMQTDL 
KATGRKISPRBKTPEVIDATEEIDKDLKETGRRBISPEENGPEE 
VKPVDEMETDLKTTGREGSSREKTREVIDT^AEVIETDLEETERE 
ISPQE 


5744 


3 


703 


TRRj^A u l-i«P*rj , xRO>1TTrPAALPTTVVTT^ 

VFTTAim^LTPSTIiPF^TGLJjTPEPSK^ 

SDSWSSAESTSADTVIJjTSKESKVWDI»PSTSHVSMNKTSDSVSS 

P0PGAS I>TAVPEQNKTTKTGQMDGI PMsS MKNEMP ISQLLMI JAP 

SIXIFVLFALFVAFLLRGKI^BTYCSQK^ 

QHGREDEDGLFTL 


5745 


1400 


599 


GKSRFVNLMKHS KKTYDS FQDELEDY1 KVQKARGliEPKTCFRKM 
KGDYI*ETCGYKGEW3RPTYRMFDQRIiPSETIQTYPRSCNIPQT 
VE^bPQWI^AHDSRLRIiDSLSYC^FTRIX^EKPVPI^FNQQK 
Yra3SHGVEHRVYKHFSSDNSTSTHQASHKQlHQKRKRHPEEGR 
EKSEEBRSKHKRKKSCEEII>IJDKHK^IQRKKTEVEIETVHVSTK 
lU^KNRKEKKSRDWSKKBERKRTKiCKKEOGQERTEEEML I 
LGF 


5746 


3 


821 


SFASGPXTPSSPAFlXSELDIiQRYSWGPAVSAWSLGMGAVSWSES 
RAGERRFPCPVCGKRFRPNS ILALHLRTHQPERPRSPAARLLLE 
LEERAIJiREARI^PJvRSSGGMQAT PATFX5IARPQAPSS S AFRCP 
YCKGKFRTS AERERHLHI LHRPW KOGLCS FGSSQEEELLHHS LT 
AHGAPERPLAATSAAPPPQPQPQPPPQPEPRSVPQPEPEPQPER 
EATPTPAPAAPEBPPAPPEFRCX)VOGQS FTQSWFLKGHMRKHJCA 
SPDHACPV 


5747 


2 


1328 


DRHVETIiCIHPLGPSTGSTAKTGGRNWIJCTGN^ 

PSPRGKGYSSNYRRSPERPTGDIJIF^IKI^QDVDTEPQKRNTE 

BSSSPVRKESSRGRHREKBDIKITKERTPESEEENVEWETNRDD 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A*-- Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
HoHistidine, I-Isoleucine, K=Lysine, 
I»= Leucine , M=Methionine, N=^Asparagine, 
P=Proline, Q=Glut amine, R=Arginine , 
S=Serine, T=Threonine f V= Valine, 
W»Tryptophan, Y»Tyrosine, X- Unknown, *=Stap 
Oodon # /^possible nucleotide deletion, 
\=pogsible nucleotide insertion) 








SDNGDINYDYVHELSLEMKRQKIQRHLMKLKQENMEKREEI IIK 
KEVSPBWRSKIjSPSPSLRKSSKSPKRKSSPKSSSASKKDRKTS 
AVSSPLLD(K2U1^KINQSKXK6PRTPSPPPPIP£DIAI£KKYKE 
KYKVKDR I EEKTRDGKDRGRDFBRQRKKRDKPRSTS PAGQHHSP 
ISSRHHSSSSQSGSSIQRHSPSPRRKRTPSPSYQRTLTPPLRRS 
ASPYPSHSLSSPQRKQSPPRIIRSPMREKGRHDHBRTSQSHDRRH 
ERRKDTRGKRDREKDSREEREYBQDQSSSRDHRIIDRJBPRDGRDR 
RE 


5748 


934 


473 


SEGPQVPYKGLAPTLIAlFPyAGI^PSCYSSLKHLYKWAlPABG 
KKNEMLCjNLlX3GSGAGVISKTLTypiJ)IJP^ 
FGQVRRYKGLMDCAKQVLQKEGALGF F KGLS PS LLKAALSTGFM 
FFSYEFFCNVFHCMNRTASQR 


5749 


552 


1 


GPPVDPRWGSTLSLAKRPKGMIRSGSFRDPTDDVHGSVL.SLAS 
SASSTYSSAEBRMQSEQIRKLRREliBS SQRKVATLTSQLSANAN 
LVAAF^QSLWMTSRriRHIAETAHEKXJTKLLDLRF^DFLKKKN 
SRAQAVTQGALNASETTPKELRIKRQNSSDS ISSLNSITSHSSI 
GSSKDADA 


5750 


22 




IPISICIJ!*KAHIiCFIjIiPKIX:iIX}V>Da^ 

IH.BWAYVFLYYYBYRKAKDQLDIAKDI SQLQI DLTGALGKRTRF 
QENYVAQLI LDVRREGDVIjSNCEFTPAPTPQBHLTKNLEIiNDDT 
II^IKIADCEQFOMPDL(^BXAIII^ICTNFQKNNPVHT^ 
VEI»IAFTSail^PKPWAJG/rSAliIIJtTKLBKGSTO 
TQALADQPEOKTTSVIiERLKI FYCCQVP PHWAI QRQLASLLFEL 
GCTSSALQIFBKLEMWB 


5751 


3 


751 


SCGSAIiRAWRCGAAALAT FPAPALPGIiMYRALYAFRS AB PNALA 
FAAGETFLVLERSSAHWWLAARARSGETGYVPPAYIiRRJjO^LEQ 
DVIiQAIDRAI EAVHNTAMRDGGKYS IiBQRGVLQKL IHHRKETLS 
RRGPSASSVAVMTS STSDHHLDAAAARQPNGVCRAGFERQHSL.P 
SSEHLGADGGLFQI PLPSSQI PPQPRRAAPTTPPPPVKRRDREA 
LMASGSGGHNTMPSGGNSV5SGSSVSSCI 


5752 


3 


471 


GPVCGVGI^VAWAGPWRGPVHSVGGGGRAALHGAKTiPCLSGAAT 
VKHEMELRHKNKMLRVtrrBARARAKAERENADI IREQIRLKASE 
HRffn^SIRTAGTLFGEOFRAFVTDRDKVTATVNl FIKC^WQ V 
AERQHVGASWSPRSCPCRLCCAI. 


5753 


34 


483 


DDSXAI PGGVQAP FGAVRN I YT PRTGHRI RKIiDQ I QSGGN YVAG 
GQEAFKKLNYLDIGE I KKRPMBVVKTTEVKPVIHSRIIJVSARFRK 
PLQEPCTI FIiIANGnLI^PASRLLIPRICIXNOWDHVL^^ 
TLRSGAVHRLYTLEGRLV 


5754 


14 


331 


TLVHWE FAGBHAEAXASREQBVI^X3WKEIiI>SACEDARLHVSST 

ADALRFHSQVRDLLSWM1X3IA^ 

WPTPATPSPLTAPFSMB 


■ 5755 


3 


888 


LGDQFYKEAIKHCRS YNSRIXMRSVRLPFLIWJ'GVAG^NCY I 

WMEKRHRGPGIAPGQLYTYPARCWPJOXRRLH^ I K 

PBVFXPLKKIX3PTSESTTIjEAI>IjRGFXJVEKKVDAREEE5IQEIQ 

RVLENDENVEEGNEEEDLEED I PKRKNRTRGRARGSAGGRRRHD 

AASOjBDHDKPWCB'IOGKRYKNRPGljSYH 

DQBTRSPPNHRNENHRPQKGPDGTVXPNNYCDFajGGSH^ 

GRPEELVSCADCGRSAHLGGKGRKEKEAAA 


5756 


3 


621 


SS KLQALFAHPLYNVPESPP LLGAEDSIuaASQEALR Y YRRKVAR 
WNRRHKMYRHQMNLTSI»DPPI^LRX.E7\SVfVQF;{IjCINRHGLYSR 
SS PWSKLIX^WRHFPTISTU^YSOJ^EkaIXGACDCTQIVKPSGV 
HLKLVLRFSDFGKAMFKPMRQQRDEETPVDFFYFIDFQRHNAEI 
AAFHLDRILDPRRVPPTVGR rVNVTKEII» 


5757 


3 


473 


YKDAJLJOLPDNHRQXArFENGTI^TDVQKGNIDEGEYIjCSVLIQPQ 
LSISQSVHVAVKVPPLIQPFEFPPASIGQLIiYIPCWSSGDMPI 
RITWRKDGQVlISGSGVTIESKBFMSSLQISSVSLKHl«Om , CI 
ASNAAATVSRRRQLIVRVPPRFW 


5758 


1 


474 


FRRGAGAERGEHREGERGAAGMGBFKVHRWFFNYW£K3IRCVA 
YMJQSNRLAVSR^nX5T\reiYNLSANYFQBKPFPGlffi 
WAEGQRLFSAGLNGB IMEYDLQALNT KYAMDAFGGP I WS MAAS P 
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NO: 


Pre dieted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of. 
amino acid 
sequence 


nucleotide 
location 
corresponding 
to Eirst 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A«Alanine, C=Cysteine, D^AspartiC Acid, E» 
Glutamic Acid, F= Phenyl alanine, G-Glycine, 
H^Histidine, I^lsoleucine, K^Lysine, 
L=Leucine, M=Methionine, N^Asparagine, 
P=Proline f Q^Glut amine, R=Arginine , 
S=Serine, T«Threonine, V~Valine, 
WaTryptophan, Y«Tyxosine, X=Dnknowti, *=Stop 
Codon, /^possible nucleotide deletion , 
\=possible nucleotide insertion) 










5753 


2 


1240 


GNAAFAGQGV VYETFHMSDLPS YTTNGTVH^ 

MARSSPY PTDVARVVTtfAP IFHVNADDP EAVI YVCS VAAK WRKT F 

NKPVGADLVCYRRRGHNEMDB PMFTQPLMYKQ IHRQVPVLKKYA 

DKLIABOTVTI^EFEEEIAKYDRTCTRAYGRSKDKKILHIKHWL 

DSPWPGFFNVDGEPKSMTCPATGI PEDMLTHIGSVASSVPIiEDF 

KXHTGI^RILRGRADMTKNRTTO^^ 

MGQDVERGTFSHREEIV1»HIX3SEVDRRTCT 

S S I»S E YG VLG FKLGYAMA3 PNALVLWEAQ FGDFHNTAQ C 1 1 DQF 
I STGQAKWVRHNGIVLLLPHGMEGMGPBHS SARPERFLQMSNDD 


576C 


1 


1221 


VRDI TSDSLS LSWTVP EGQ PDKFLVQFKNGDGQP KAVR VPGHBD 
GVTISGLEPDHKYKMNLYGFHGGQRVGPVSAYGLTAPGKDEEMA 
PASTEPPTPEP?IKPRIiEELTVTDATPDSI*SLSWTVPEGQFDHF 
LVQYKNGIX3QPKA111VPGHEDRVTISGLBPDNKYKMNLYGFHGG 
GRVGPVSAIGVTAAEEETPTPTRPSMEAPEPPEEPIXGELTVTG 
SSPDSI>SI^WTVP<X5RFI)SFTVQYia)RDGRPQVVRVGGBESEVT 
VGGLEPGRKYXMHLYGIJIEGRRVGPVSTVGVTAPQEDVDETPSP 
TEPGTEAPBPPBE PUiGELTVTGSSPDSLSLSWTVPQGRFDS FT 
VQYKDRIX3RP0AVRVGGQKSKVTVRGI»BPGRKYKMHLYGIiHEGR 
RLGPVSAIGVT 


57(il 


3 


1275 


SCDMAEAAAIATW IRGPGFGCKAVRCASGRCTVRDFIHHHCQDON 

VPVEOTPVKOfGAIiIlWSDTVQHGAV^^ 

RAIiGAQIBKl iNRKACRDI^GKI^ 

AEKEQKRLERLQRKLVE PKHCFTSPDYQQQCHEMAKRLEDSVIjK 
GMQAAS S KMVSAE I SENRKRQWPTKSQTDRGAS AGKRRCPWIiGM 
EGLETAEG6NSESSDDDSKEAPSTSGMGFHAPKIGSNGVEMAAK 

KHMESRHVTETEETQEKKAESKEP lEEEPTGAGLNKDKBTEERT 
IX^VAEVAPEERENVAVAKLQI^PGimVIUKETIDLLAFTSV 
AELEMiGLEKLKCELMALG LKCGGTIX} 


5762 


2 


344 


MSSEEAANGKKSHWAEJUE ISGKVRSLSASLWSLTHLTALHI>SDN 
SLSR1PSDIAKLHNLVYI.BLSSNKIR 


5763 


3 


129 


LDKDTGLIMIjIARIiDYEilQRFTIjTI 
IiDVNDNVPTFQKnAY^KSALPJ^ 

ITYSrVSASAFGSYFDISLYEGYGVISVSRPLDYEQISNGLIYL 
TVMAMDAGK 




1 Q 


441 


VCARACGBHRQLJjRP idrqrydenedlsdvebivsvrgfslbek 

LRSQLYO^DFVHAMBGKDFNYBYAFQREAIJtVPI»I FREKDGljGI K 
MPDPDFT^DVKlJiVGSR^VDVMDVl^KGTS^HSQFVRYYE 
TPBAQRDKL 


5765 


3 


825 


QKILRI^NSHQPPTSSSNSKDCGGPASSGAGATAALADGLKFAS 
VQ AS APQGNS HKETS KS KVKRS KTS KUAN ICS L?SAAL YGI PE I S 
STGKRQBVQGRPGEATGMNSAIiGQSVSSGGSGNPNSNSTSTSTS 
AATAGAGSCGKSKEEKPGKSOSSRGAKRDKDAGKSRKDKHDLLQ 
GUQNGSGSO^PSGGHLYGP^AKSNGGGASPPHCGGTGSGSVAAA 
GBVSKSAPDSGIllGNSWI^VKXBEEEEBSHRRIIGa^KTEKVDPLF 
TVPAPPPHV 


5766 


1608 


663 


SGLFSVDPASSQAMKLSDVTLI EGVGNKVMWAGWVLIIiALVL 
AMLSTYVADSGSNQLI^AIVSAGiyrSVLin^GHVDHLVAGQGNPE 
PTELPHPSEGNDEKAEEAGEGRGDSTGEAGAGGGVEPSLEHLLD 
lO^I^KROAGAGSSSPEAPl^BDSTCE^PSPGLITVRLKFI^rD 
TEELAVARPEDTVGALKS KYFPGQESQMKLI YQGRLLQDPARTIj 

rslniti^cvihchrsppgsavrcpsasijvpsateppslgvnvg 
slmvpvfvvllgvvwyfrinyrofft^atvsl 

FGHYGR 


5767 


2 


892 


NFRATPRPPTRPF^RTGTEVILWYIJWRAI^ 

SG FPL PSSDLDDS LTEE I DEKIGFRin?ANFl>WQNVADFRnAGGS 

LTEVKVBBEERDPQSPBFBIEBEBEMLSSVIPDSRRENEI»PDFP 
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NO: 
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location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
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Predicted end 
nucleotide 
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corresponding 
to first 
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residue of 
amino acid 
sequence 


*wu»ud auiu segment coiiLaining signal peptide 
(A=Alanine, C=Cysteine, D^Aspartic Acid, E=* 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I-Isoleucine, K=Lysine, 
L— I>eucine , M=*Methionine, N=^Asparagine f 
P^Proline, Q=Glutamine, R»Arginine, 
S=Serine, T-Threonine, V=Valine, 
W^Tryptophan, Y-Tyxosine, x^onknown, *=*stop 
Cod on., /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








HIDKt'Kr^STPSRSAYDEPHLLVNIEKQKLELEKHRLDXEABK 
LQVEKERLQlEKE^RHLDMKHERLQLBKERLQIERE KLRLQI V 
NSEKPSLKNELGQGEKSMI^PQDIBTKKIiia^RERJjQIiEKDRLQ 
FIiKFESBKLQ IEKERLQ VEKDRLR I QKEGHTkQ 


5768 


3 


476 


SSRSRLSVSVSPPPPGIVBLGPPFAWEFCSRLGSAVTSQRAGPA 
AAMVAKDYPFYLTVKRANC^LEIiPPASGPAKDABEPSNKRVKPL 
SRVTSLANBIPPVKATPLIO^SQTLQRSISFRSBSRPDILAPRP 


5769 


38 


6*7 


TKVKKG VKE KATDQS VKAFAEIIC PEIjQ YVGFMGCSVTS KG VIHL 
TKIJ^NLSSIJ^LRHITELDNETA^ 

DRCVEVIAKEGQNLKELYl.VSCKITDYALIArGRYSMTIETVDV 
GWCKKITIXKiATblAQSSKSLRYLGLMRCDKVNEVTVEQLVQQY 

trill Ifol VJjV/xJLIVICI JbrSKAIUraj'W i tfvimU\t\^tc» 


5770 


1 


484 


DSRRYDVKTRKWSFI1LEEHSKLIAKVRCLPQVQLDPLPTTLTI1A 
|? -*»W A » R Jvi»>i*5>Jul IrJJ VFlS/UJJjbisVUFKJj vbWljMt 1 ryK/tOVNcAI 
AKGGRLLLADDMGIiGKTIQAICIAAFYRJCEWPLLVVVPSSVRFT 
WBOAFbRWLPSLSPDCINVVVTGKDRIiXA 


5771 


168 


741 


GLLPSACLRARS WREAS EGPSSRACSNGSQDTFKACTY SGTSTPS 
FHC^HCSGSDHSSU3I*BQLQDYMVTI*RSKIX5PI£IQQPAMLLRE 
YRIiGI«PIQD YCTGLLKLYGIWRKFI*IJiGMRPFI PDQDIGYFBGF 
i»&v* vv# XKsfcfoi i» 1 Lib rfcrKl KRSM5STSASAVRS YDGAAQRPEAQ 
APHRLLADITHDIB 


5772 


148 


383 


EFNI»ALVSPSHPQIKAEDDQPLPGVliI^I>SGGLFRSNLI,TQDNG 
ILTFSNLVTCSAIYHLPVFPEREPGCSMRDLRVA 


5773 


2 


723 


PRVRS KHNFC FMEMNTRLQ VEHP Vl'KM I TGTDLVEWQLR IAAGE 
RJ.f l»i>UEKi XliOGHAFEARIl AEDPSNNFMP VAGPLVHLSTPRA 
DPS TRI ETGVRQGDEVS VH YDPM I AKXVVWAADRQAALTKLRYS 
LRQYNIVGLirmiDFLLNLSGHPE FEAGNVHTDFI PQHHKQLLL 
SRKAAAKESLCQAALGLILKEKAMTDTF J TIK3AHI5QFSPF5SSSG 
RRLNISYTRNMTLKDGKNSK 


5774 


2 




pvfilSriU IK v VRCU\jblSljNr RKAVFSADSKYIFCVSGDFVKVYST 
VTEECVHII^GHRNLVrciQLNPNNHI^LYSCSIiDGTIKLWDYl 
DGILI VGCKI^IjFTIJVQAEDS VFVIVNKEKPDIFQI.VSV 
KLPKSSSQRVBAKELSFVLDY INQS PKCIAFGNBGVYVAAVREF 
YIiSVYFFinCETT*; PVFT.QQ <5 

* V J 4. f AAA JltpnVl 1»M 


5775 


3 


538 


SSGCCDPAAPSSIJVEAATMPVSKCPKKSBSLWKGWDRKAQRNGL ' 


RSOVYAVNGDYYVGEWKBNVXB^ 

RIOTGTI^r.PDQQTGKCRRVYSGWHKJGDKKSGYGIQFFGPKEYY 
BGDWCGSQR3GWGRMYYSNGD I YEGQWENDKPNGKGMTiRI*SQNP 
RP 


5776 


2 


484 


RLPQDCVCX^^BSI^GTLCPSKGI^ 

I IHISRQDPANMTGLVDLTLSRNTI^ttQPFSFI^l^SIjRSIiHL 
I)SNRIJ?SJjGBDTLRGLVNLQHLIVHKNQ1jGGIADEAFEDFLIjTIi 
EDLDIjS YNNIiHG PAVGLRGDAWQPS TS 


5777 


2 | 


949 


GQDPEPGQDLFQPEREVDPSWGRGREPRLGKIjRFQNDHIjSVLKQ 
VKIOuEQM.KIX^AGLDPQLPGTCYSPHCPPDKAFJ^STLPENIX? 

ggsgswsqr\mpsdlex3rbptpelvedpjcgscrrpwdrsi*env 
yrgsegsptk^finplpkprrtftoiagegdkdgkpgigfrkekr 
nlpplpslpppplpsspppssvnrrlwtgrqks3adhrksyefe 

DLLQSSSF^SRVDWYAQTIO^LTRTI^EENVYEDILDPPMKBNP 
YEDIELHGRCU?KECVI^PASPTSSI PDTLTKQSIiSKI>AFFRQ 
NS2RRNV 


5778 


1 


1210 


QRRQSVSRLLLPVFUjBPPAE PGLEPP PEEEGGEPAGVAEEPGS 
GGPCWLQLKEVPGPGPLGGGGPLRSPSSYSSDELSPGEPLTSPP 
WAPLGAPERPETOLNRVLERLAGGATRDSAASDILLDDIVIjTHS 
LFLPTEKFliQEl^QYFVRAGGMEGpE<3X5R^ 
YQGLLQEBBGAGH 1 1 KDLYLIiIMKDESIjYQGLRBDTLRIiHQLVE 
TVFJjKlPBEl^PPSKQVKPLFPJBFRRIDSCliG^rRVAFRGSDEIF 
CRVYMPDHS YVTIRSRLS ASVQDI LGSVTEJCLQ YSEEPAGREDS 
L ILVAVS S SGE KVLLQ PTEDCVFTALGI NSHLFACTRDS YEALV 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
Ht=Histidine, I^Isoleucine, Lysine, 
L= Leucine, M^Methionine, N=Asparagine, 
P^Proline, O-Glutamine, R=JVrginine, 
S= Serine, IVThreonine, V= Valine, 
W-Tryptophan, Y=Tyrosine, X= Unknown, *=stop 
Codon, /"possible nucleotide deletion, 
\=possible nucleotide insertion) 








PLPKEIQVSPGDTEIHRVEPEDVANHXTAFHHBLFRCVH3LEFV 
DYVFHGB 


5773 


138 


1671 


EAVQV LI KHS ADVNAHDKNWQTPLHVAAAN KAVKCABVI I PLLS 
SVNVSDRGGRTALHKAATiNGHVFTWNI^ 

ALH WAAYMGHIiD WALL INHGAE VTCKDK1CGYTPLHAAASNGQ I 
NWKHLLNLGVE I DK I NVYGNTALH IACYKGQDAVVNELIDYGA 
NVNQPNNNGFTPLK^AAASTHGALC^ 

S PIJMFAVHGRFTRSOT1.IONGGE IDC?VTMCDRTflTPT.Tn7a ad WiW 
BI^OTLITSGADTaKCGIHSMFPIiHLAALNATO 
QKYSIVSLPSNEHVI^AGFBIiyrPDKFGRTCLHAAAAGGNVECl 
KLLQSSGADPHKKDKCGRTPLHYAAANCHFHCIETLVTTGANVN 
ETDDWGI*TALHYAAASDMDRNKT I LGNAHDNS EELBRARELKEK 
EATLCIjSFLLQNDANPS IRDKEGYNS IHYAAAYGHRQCLELLLE 
RTNSGF3ESDSGATKS PLHLAVSEMP 


S780 


154 


624 


QFPRVITCLPPKGPDYRLYKSEPBLTTVAEVDBSNGEBKSEPVS 

DLRTERPRSAVKQI/XAESTRPRMTVEEQMBRIRRHCQACLREK 
KKCJLNVIGASDQSPLQS PSNLRDNP 


5781 


19 


941 


rgslgghpwrp.pmraasc^cijPvsfvtgphqbray;g^gpggaf'' ' 

pappvsgtcppdl i yaptpbkak3gsqknhqpppgbhsaahrdgb 

g^pcragptrkvavappjpscp*gpe\pgeeprrpldrspplgq 

vqphftsqdakeaedeapsrhlgkhqprsaqvgsrl^^ 

qhsihtvtcksprqkerrspkppojftpkhpkbhgrq3\qappplp 

vapsrtcx3gc*twdpallvsp/pqgdstpelpap\qqptggpsr 

croalppqg*rqqprqrpr/ptgasrshpakakgcqgppkirliy 

NIMD 


5782 


5176 


1237 


DRSI^MAADSYTDSYTDTYTEAYMVP?tPPfifipPTMPPLPPEE 

PPMTPPLPPEEPPEGPALPTEQSALTAENTWPTEVPSLPSEESV 

SQPKPPVSQSEISEPSAVPTDYSVSASDPSVLVSEAAVTVPEPP 

PEPESS ITLTPVESAWABEHEWPBRPVTCWVSETPAMSAEPT 

VIASEP PVMSBTAETFDSMRASGHVASEV3TSLLWAVTTPVLA 

BSII^PPAMAAPESSAMAVIiESSAVTVLESSTVTVLB^ 

EPSWTVPEPPWAEPDYVTI PVPWSALBPSVFVLEPAVSVLQ 

PSMIVSEPSVSVQBSTVTVSEPAVTVSEGyiXJVIPTEVAIESTPM 

ILESSIMSSHWKGINLSSGDQNLMEIGMQEIALHSGEEPHAE 

EHLKGDFYESEHGINIDLNINNHLIAKEMEHNWCAAGTSPVGE 

IGSEKILPTSETKQRTVUDTYPGVSEADAGETLSSTGPPALBPD 

ATG\TSKGI3FTTASTI^LVNKYDVDLSLTT(»)TEHDMLISTSP 

SGGSEADIEGPLPAKDIHO)LPSNII^V3SDTTrePLPVKRD\DQ 

TLAALI\SLKESSGGEKBVPPP3*REHLPD5GFSANIED1NEAD 

LVRPVSSPRTIWVLPSPRAGL\BGP\LIASDFGPVQNLYSSPVV 

\SSMP\ERASGS\SSGENGG\ YEI FVKVKDTHE3CSKKNKNRDKG 

EKEKKRJDSSLRSRSKRSKBSEHKSPJaTSESRSRARKRSSKSKS 

HRS\Cn'RSRSRS/RDRPJ^SRSRSKSRGTUlSVSKEKRKRSPKH 

RSKSRERiCRKRSSSRDNRKTVRARSRTPSPJtBRSHTPSRRRilSR 

S VGRRRS FS IS PSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRR3 

RTPSRRSRTPSRRERSRSVVRRRSFSISPVRLRRSRTPLRRRFS 

RSPIRRKRSRSSERGRSPKRLTDLDKAQLLEXAKANAAAMCAKA 

GVPLPPNLKPAPPPTIEBKVAKECSGGATIEHLTBKCKQIAQSKE 

DDDVIVNKPHVSDBEEEEPPFYHHPFKIjSBPKPIFPNLNIAAAK 

PTP PKS QVTLTKEFPVSSGSQHRKKEADSVYGEWVPVEKNGBEN 

KDDDNVFS SiJLPSEPVDI STAMS ERALAQKRLSENAFDLEAMS M 

LNRAQE RI DAWAQUv S I PGQFTGSTGVQ VLTQEQLANTGAQAW 1 

KJCDQFLRAAPVTGGMGAVL^^ 

DFKTDRKGLVAVGERAQKRSGOTSAAMKBLSGKHPVSALMEICN 
KRRWQP PE FLLVHDSGPDHRKHFLFRVLING6AYQPNCMFFLNR 
Y 


5783 


1693 


698 


IISGLRVAFTMEGI^NFKTPS KLSEKXKSVLCSTPTItf I PASPFM 
QKLGFGTGVNVYLMKRSPRGLSIIS PHAVKKINPICNDHYRS VYQ 
KlU^EAKILK5LHHPNIVGYRAPTEAOTGSlXn^EYGGEKBL 
NEOBB/PI*SQ/PKILFO£P/LILKVAI^^ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal pepto.de 
(A=Alanlne, C= Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G -Glycine, 
H=Histidine, I^Isoleucine, K-Lysine, 
L=Leucine, M=Meth±onine, N-Aaparagine, 
P=Proline, Q=Glutaraine, R=Arginine, 
S-Serine, T=Threonine, V=»Valine, 
W^Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








LHGDIKSSNVVIKGDFBTI KICDVGVSLPLDBNMTVTDPBACYI 
GTBPWKPKBAVEBNGVTTDKADIPAFGI*TIjWQ®n , LSIPHINIiS 
ITODDDEDKTPDESDFDDKAYYAAIjGTRPPINMEBLDESYQKVIE 
LFSVCTNBDPKDRPSAAHIVBALBTDV 


5784 


26S$ 


1388 


PRVRPRVRTDHNYYI SRI YGPSDSASRDLWVNIDQME KDKVKIH 
GILSNTIlRQAARVNLSFDFPFYGHFIiRKITVATGGFIYTGEVVH 
RM LTATQ Y I APIJ^AN FDPS VS RNSTVRYFDNGTAL WQWDHVH L 
QDNYNLGS FTPQATLLMDGRI I FG YKB I PVLVTQ I SSTNHPVKV 
GLSDAFVVVHRIQQIPNVRRRTIYFYHRVBLQMSKITNISAVBM 
TPLPTCLQFNRCGPCV3SQIGFNCSWCSKLQ^CSSGFDRHR0DW 
VDSGCFBESKBKMCENTEPVET\FI£PPQP*3RQPPSSGS*IJ?P 
E/EAVTSQFPTSLPTEDDrKIAI^LKDNGASTDDSAAEKKGGTL 
HAGLIVGILILVLIVATAILVTVYMYHHPTSAASIFF1KRRPSR 
WPAMKFRRGSGHPAYAEVEPVGBKBGFIVSRQC 


5765 


2669 


1388 


PRVRPRVRTDHNYYISRIYGPSDSASRDLWVNIDQMEKDKVKIH 
GI bSNTHRQAARVNLS FD FPFYGHFLRB I TVATGGF I YTGEWH 
imLTAXX}YIAPLMANPT)PSVSRNSTVRYFDNGTALVVQWDHVHIi 
QDNYNI/5S FTFQATLLMBGR 1 1 FG YKE I PVLVTQI SSTNHPVKV 
GLSDAFVVVHRIGXJIPEIVIUmTIYEYHRVKIiQMSKITWlSAVHM 
TPLPTCXQFNRCX5PCVSSQIGFNCSWCSKLQRCSSGFDRHRQDW 

vdsgcpeeskekmckntkpvkt\fi,eppqp*erqppsscs*lpp 

E/DAVT^FPTSLPTEDimCIALHr»KDNGASTDDSAAEKKGGTL 
HAGLIVGI LI LVLIVATAILVTVYMYHHPTSAAS IFFIERRPSR 
WPAMKFRRGSGHPAYAEVEPVGEKEGFIVSBQC 


S78^ 


25*2 


1674 


SYKLPAAERRASSCSQPPTPTRRRWPAPGRTSRGHRPQM*SGTP 
APRPPARSTVSPASPLPKPRAGRCGSRPRSACSTFRPC*SIiN*M 
S * H * KRNLSQRS S SMSRRPkS CARPHR* * RQGI*TVAARLPTW AK 
SPPLACSFCOAAGKSQSLSSGRSTR* PERMS FRP\SPPGNPAIP 
SLAPSSRP/PKGRPQCTWIPSRWPASPTAPPTTT*APTSSPGST 
GRSMMTCPTRWTATPWSARASSRPRNWPTP * WRPSGRLSTV* RA 
TGGSTATAPPKRFPRNWNFMMAE 


5787 


2 


1460 


MAS AAS VTS LADEVNCP \ I CQGTLKBAGSLSNCG/ HKNFCRACL 

T\RYCTIP\GPD\LBBSP\TCP\l.CKEPFRP\GSFRPNWQIiAW 

VENIERIiQLVSTrKSLGBEDVCQEHGEKIYFFCEDDEMQIXOTCR 

EAGEHATHTHRFLEnAA\APYREQlKJCCIiKCl»IKBRK]SIQEIQS 

RBNKRMQVtiLTQVSTKRG£VISJ3FAH^^ 

QDGDI LRQRDE PDLLVAGEI CRFSALI EBLEBKNERPARBLLTD 

IRSTLIRCETRKCRKPVAVS PELGQRIRDFPQQALPLOREMKMF 

U3KLCFEIJ3YBPAHISIjDPGT£HPK1^ 

PDNPQRFDRATCVLAHTGI TGGRHTWW3IDLAHGGSCTVGWS 
EI^RKGEIiRLRPEEGVWAVRIAWGFVSALGSFP\TRr*TLKEQP 
RQVRVSLDYBVGWVTFTNAVTRBPIYTFTASFTRKVIPFFGIiWG 
RGSSFSLSS 


5788 


2 


6860 


EUSVSGRSSAYGDATABGHPAGPGSVSSSTGAISTTTGHQEGDG 

SEGHUKGKTEGDVUTSNRLHMVRLWLLKIITJ^TLPQLRNVOGVR 

AIPYMQVTI^LTTDJblXSF^EKDKGALl^^ 

SKKNERSAIJ!ffiVIILVVmT.LSVFMSRTKSGSKSSICESSSLISS 

ATAAAIil>SSGAVDYCUT^KSLIiBYWKSQC»roEKPVATSQLLKP 

HTTSSPPDMSPFFLROYVKGIIAADVFEAYTXJLLTEMVT.RLPYQI 

KKJTDTNSRIPPPVFDHSWFYFLSEYl,MrQ(rrPFVRRQVRKLLI> 

FICESKEKYRQLRDIJrrLDS\HVRGIKK^^ 

SP03ALQYDTLISLMEIILKACAEIAAQRTINWQKFCIKDDSVLY 

FIJjOySFI»VDEGVSPVr^MiS(^CGSKVl^^ 

SSPAPVAASSGQATTQSKSSTKKSKKEEKEKEKDGETSGSQEDQ 

UT17*LVN0J^NKFADKBTLrQP^ 

IYI^SKSQQEIJ,rj)I^SIWPEI^AYGRJCAAQ^ 

TPQTB KKLKEY5QKAVEI LR TQNH ILTNH PNSNI YNTLSGLVEF 

DGYYLESDPCLVCNNPEVPFCYI Kt^SIKVDTRYTTTQQVVKIiI 

GS HTIS KVTVKIGDLKRTKMVRT I NL YYNNRTVQAIVELKNKP A 

RWHKAKKVQLTPGOTEVKXDLPLP IVASNLMr BFADFYENYQAS 

TBTLQCPRCSASVPAKPGVCXMCGBWVYQCHKCRS INYDEKDPF 
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ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, OCysteine, D«Aspartic Acid, E=i 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=His t i dine , I = Isol eucine , K^Lysine , 
L=Leucine, M=Methionine, N=Asparagine , 
P=jproline, Q=Glut amine , R=Arginine , 
S=Serine, T^Threonine, V»Valine, 
{^Tryptophan, Y=Tyroaine, X-Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\»possible nucleotide insertion) 








LCmCG^CKYARFDFMLYAKPCCAVDPIKNEBDRKKAVSNlNTL 
LDKADRVTHQI^GHRPQIjENLLCKVNEAAPEKPODDSGTAGGIS 
STS ASVNRYI LQLAOE YOGDCKNS PBELSKI I QKVFASRKELLE 
YDLQQRRAATKS SRTS VQPTFTASQYRALS VLGOGHTS S TKCYG 
CASAVTFJTCITLLRAIATNPAIJmiLVS^ 
AAMREEVRQLM(^LTRDNPEATQQMNDI>I IGXVSTAUKGHKANP 
DIiASSLQYEMLIiLTDS I SKEDSCHELEtLRCAIiSLFLMAVNI KTP 
VWKNITIMa^RII^KLIK^PAPTSIOCNKDVPVEALTTVKPY^ 
KIHAQAQLW IxKRD P K AS Y DAWK KC L P I RG I DGNGKAP S ICS KLRH 
LYL>TBKYVWRWKQFI^RRGXRTSPIJ)LKLGHNNWLRQVLFTPAT 
QAARQAACT IVEALATI PSRKQQVLDLLTS YLDBLSIAGECAAK 
YLALYQKLZTSAHWKVYIAARGVLPYVGNIjITKEIARTiTiATiHRA 
TLSTDLQQGYALKSLTGLI^SFVEVESIKRKPKSRLVGTVIJ^ 
LCLRKLWQRTRTi I DBTQI)MIjI^MIjEDMTTGTE5ETKAFMAVCI 
ETAKR YNliDD YRTP VF I FERLCSI IYPBBNKVTBFFVTIiEKDPQ 
QBDFLQGRMPGNPYSSKEPGIGPLMRDIKNKI GQDCDLVAIiLED 
DSGMEIiLVNNKIISLDLPVAEVYKKVWCTTNF^KPMR 
LLGDATE E FT E SLBSTTDEEEDEBEVYKMAGVMAQCX3GLE CMLN 
PJ^IRDFKQGRHlJiTVI^KIjFSYCVKyKVNRQQLVK^ 
VMIiGTLNLAL VAEQKS KDSGGAAVAEQVLS I ME I \ IQAE PNVEP 
LSEDKGNLIJLTGDKDQLVMLr 1 DQINSTFVRSNP5iVLCGI»LRIlP 
YLS FGEVEKMQI LVBRFXP YCNFDKYDEI)HSGDDKVFL\D CFCK 
IAAG I K\NNSNGHQL\ KDL \ ILQKG I TQNALD\ YMKKHI P/SAA 
RIWDADI \WK5FCLRPALPFILRLLRGLAIQHPGTQVLIGTDSI 

RF/TRAEKKRKAMAMRQKALGTLG \ MTTMEKGQWD/TRTALLEA 

DWEEIjX BaBP\GI*TOCICREGYKFQPTKVLGI YTFTKRWLGGVW 

FJTOREI'SRATSTVSIIFNI VIIYDC\HLA\AVS LARGREBWESAA 

LQNANTKCNGLLPVVK3PHVPESAFATC1j7VRHHTYIiQEC1GQRKP 

TYQLNIHDIKLLFXiRFAMEQSFSADTGGGGRBSNIHLIPYI IHT 

GIjYVLNTTRATSIIBEKNIjQGFLEQPKEKWVESAFK^ 

LAl^I LPPEQWRATRVBIIJUIIiLVTSOARAVAPGGATRLTDKAV 

KDYSAYRSSIJ.FWALVDLIYHMFKKVPTSNTBGGWSC5I1AEYIR 

HNDt^IYEAADKALICTFQKEFMPVETFSEFIJTVAGrj^EITDPB 
SFLKDLLNSVP 


5789 


1 


2407 


I^I^VEKTGRPGQPALKMPGKI^DAGLE^DTAMKXGETLRKO 
TBElOSKKBfCPKSDXTBEIAEEBBTVFPKAKQviuu^PSEVD^ 
SPKSKKAKK\KERPSQNDISPKTK£LRKKKEPIEKKVVSGKTKK 
VTKNEBPSEBEIDAPKPKKMKKBKEMNGl^ntBICS PKLKNGFPHP 
EPDCNPSEAASEESNSEIBQEIPVEQKEG\AFSNFPISEETIKL 
LKGRGVTFLFP 1QAKTFHHVYSGKDLI AQARTGTGICr^SFAIPL 

I EKLHG\ EIjQDRKPjGRAPQ vlviaptre lanqvskdfsditkki* 
svacfyggtpyggqfermrngi dilvgtpgrikdhiqngkldlt 
iclnhvvij)bvix2mli>mgfadq veeim va ykm>s bdnpqtllfs 
atcphwvfnvakkymkstyeqvdl^ kch 

WTQRAAVIGDVIRVYSGEIQGRTI I FCKTKKEAQBLSQMSAI KQD 
AQSLHGD I P QKQRBlTIiKGFRNGS FGVLVATNV7AARGLDIPEVD 
LVIQSS PPKDVESYIHR^RTGRAGRTGVCICFYQHKBEYQLVQ 
VEQKAG I K FKRIGVPSATE 1 1 KASSKnAI RLlLDSVPPTAISHFK 
QSAEKLIEEKGAVFJUAAAI^ISGATSVDQRSLINSmraF 
II^CSIEMPNISYAWKBLKEQI^EEIDSKVKGfWFLKGKDSVCF 
DVPTASVTEIQEICWHTJSRRWQLSVATEQPELEGPREGYGGFRGQ 
REGSRGFRGQRDGNRRFRGQREGSRGPRGQRSGGGNKSNRSQNK 
GQKRSFSKAFGQ 


5790 


3786 


1585 


ARRQRDPLOAlJiRRNQELKOOVDSLLSESQLKSALEPNKRQHiy 
QRCI QLKQ AI DENKN ALQKL»S KADESAP VANYNQRKBEEHTIiLD 
KLTQ0LQGIJVVTISRB19XTBVGAPT6EEBES ESEDSEDSGGEER 
DAEEEEBEBOBBNBSHKWSTGEEY IAVGDFTAQQVGDLTFKKI3E I 
LLVIEKJCPDGWWIAKDAKGNF^LVPRrYLEPYSEEEEGQESSEE 
GSEBDVEAVDETADGAEVX\QRTDPHWSAVQKAJSEAGI FCLiVN 
HVS FC^LIVIiMRNRMF/rVEIJrNGSETGFRAWNVQSRGRIF^ 



372 



WO 01/53312 



PCT/US00/34263 



1 SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location, 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=Alanine, C-Cysteine, D-Aspartic Acid. E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I^Isoleucine, K=Lysine, 
h*> Leucine, M=Methionine, N-Asparagine, 
P^Proline, Q^Glutamine, RaArginine, 
S=Serine, T=Threonine, V»Valine, 
tt-Tryptophan, Y=»Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








PVLQQl^TVDVLTTMGAIP 

PEX^PSQIiAJf'RDLMWDATEGTIRSRPSRISLILTLWSCKMIPLP 

GMS IQVLSRHVRLCLFTX3NKV1^NIBTVRAT^ P 

QVTRILPCLLDGDCFIRSNSASPDlJGILFEIiGISYIRNSTGBRG 

EI>8C5SWVFLKLFDASGVPIPAKTYELFLNGGTPyE^ 

SRRAHGSVFYQIMTMRRQPQLLVKLRSLNRRSR^ 

GNMCSIHIiLIFYRQILGl)VIiLKDR 

LEQPDVMDALRSSWAGX}ES\^ 

\GC^PI»L/HTPTRLPPF^WAEBB1^ARWKVITDFIJCQNQ 
GALQALLSPIXJVIIBPFDI^EO/ITDFIjGEMRKNAV 


5791 


3 


1636 


LRVAEFAGTSR/ IGAGLIQPIHRAPARDHGIiIjRGGAAPAIjSVSH 
GN/ GKQL/AMSSQQSDDEQI KRENI RSLTMSGHVGFESLPDQIjV 
NRS I QQGFCFNII*CVGETG IG KS TG I DXIj fntnfe d ye s s hfcp 
NVKIiKAQTYELQBSNVQLKLTIVimraFGDQINKEESYQPIVDy 
IDAQFEAYLQBELKI KRSLFTYHDSRIHVCI*YFISPTGHSLKTL 
DLLTMKNIiDSKVYII P VI AKADTVS KTELQKFKIKLNSELVSNG 
VQI YQFPTDDIXriAKVNAAMNGQLPPAVVGSMDEVKVGNKMVKA 
RQY PWGVVQVENEKHCDFVKLREML I CTNMKDLREQTHTRIIYEL 
YRR<^a^EMGPTDVGPENKPVSVQETYKAKRHEPBGERQ 
MKQM FVQRVKEKEAILKEAERBLQAKFEHLKSUIQEERMKLEEK 
RRLLEEEI I afskkhatsei FH3QS FIATGSHLRKDKDRKNSQF 
FVKQKVPEHRRSSSQANFIKKKLEVGFDFAVTCFITSIFGEQPQ 
LLIFMEKYFQVQGQYISQSE 


5792 


2263 


653 


AAAAPSPAWWqSVFVVYVVHTCWVMYGIVYTRPCSGDASCIQFY " 
LARRPKI<JL\RHSF^TTRSHI/?AENNIDLVLNVEDF0VESKFER 
TVNVS VPKKTSNNGTL YAYI FLHHAGVLP WHDGKQVHLVS PLTT 
YMVPKPEEimxLTGBSDTO^IEADKKPl'SALDEPVSHWRPRLAL 
NVMADNFVPDGS8LPADVHRYMKMI QLGKTVHYLPILFIDQLSN 
RVKDLMVIIIRSTTEIjPIiTV'S YDKVS LGRIiRFW IHMQDAVYSIiQQ 
FGFSEKDADEVKGIFVDTNLYFIiALTFFVA^ 
ISFWKKKKSMIGMSTKAVLWRCFSTWIPLPLLDEQTSIiVLVP 
AGVGAAI KLWKVKKALKMT I FWRGLMPEFQFGTYS ESERKTEE Y 
IHX)AMKYIiSYIiL YPLCVGGAVYSIJ^NT KYKS WYS WLINS FVHGV 
YAFGFLFMLPQLFVNYKJLKSVAHLPWKAF^ 
IITMPTSHP^CFRDDVVFLVYLYQRWDYPVDKRRVNEFGESYE 
BKATRAPHTD 


5793 


2263 


653 


AAAAPSPAVWCGVFVVYVVHTCWVMYGIVYTRPCSGnASCI 

IiARRPiOOL\RHSFTTTRSHIiGAKNNIDLVLKVEDFDVESK^ 

TVNVSVPKKTRNNGTr,YAYI FLHHTVGVLPWHDGKQVHLVSPLTT 

YMVPKPEEINIXTGESDTQQIEADKICPTSALDEPVSHWRPRI^ 

NVMAENFVFT«SSLPADVHRYMKMIQtt3K^^ 

R VKDIJ4VlNRSTTBLPIiU^S YDKVSIX3RLRFW I HMQDAVYSLQQ 

FGFSBKDADEVKGI FVDTNLYFLALTFFVAAFHLXiFD FIiAFKND 

ISFWKKJOCSMIOMSTKAVIiVniCFS^rVVIFLFIiLDF^ 

AGVGAAI ELWKVTClGUJCMTIFWRGIiMPEFQFGT^ 

VTQAMKTLSYLhYPhCVGGAVYSliLtllEYKSVlYS WL INSFVNGV 

YAFGFI«PHLPX)LFVNYKIJCSvAHLPWKAFTY 

1 1 TMPTSHRIACPRDD VVFLVYI>YQRWL YPVX)KRRVNEF^ 

EKATRAPHTD 


5794 


1 


SQ16 


MGPRLSVWLI^PAAIil^HEEHSRAAAKGGCAGSGCX5KaX^GV 
KGQKGKRGrjPGIXX^GFPGMQGPFX5PQGPPGQKGDTGEPGI,PG 
TKGTRGPPGASGYPGNPGLPGXPGQDGPPGPPSIPGCNGTKGER 
GPLGPPGLPGFAGNPGPPGLPGMKGDPGE I LGHVPGMLLKGERG 
FPGIPGTPGPPGLPGI^PVGPPGFTGPPGPPGPPGPPGEKGOM 
GIiS FGG PKGDKGDQGVS GPPG VPGQAQVQE KGDFATKGEKGQKG 
EPGFQGMPGVGBKGEPGKPGPKGKPGKDGUKGEKGSPGFP3BPG 
YPGLIGRQGP\OGEKGEAGPPGPPGIVIGTGPIjGEKGERGYPGT 

pgprgepgpkgfpglpgqpgppglpvpgqagapgfpgergekgd 
rgfpgtslpgpsgrdglpgppgspgppgqpgytngivecqpgpp 
gdqgppgipgqpgfigbigekgqkgesclicdiugyrgppgpqg 
ppgeigfpgqpgakgdrgi*pgrdg^mgvpgpqgctgi*i 
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Amino acid segment containing signal peptide 
(A=Alanine, C^Cysteine. D^Aspartic Acid, E=. 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K>Lysine, 
L=Leucine, M=Methionine , N^Asparagine , 
P^Proline, Q=Glutamine, R=Arginine, 
S«Serine, ^Threonine, V= Valine, 
W=Tryptophan, Y=Tyro3ine, X^Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








GBPGEFYFDLRLKGDKGDPGFPGQPGMJ^RAGSPGRDGHPGLPG 
PKGSPGSVGLKGBRGPPCGVGFPGSRGDTGPPGPPGYGPAGPIG 
DKGQAGFPGGPGSPGLPGPKGE PGKIVPLPGPPGAEGIiPGS PGF 
PGPQGDRGFPGTPGR\ PGL\PGEKGAVG\QPGIGFPGPPGPKGV 
DGLPGD^PPGTPGRPGFITCLPGNPGVQGQKGEPGVGLPGLKGL 
PGLPG I PGTPGEKGS IGVPGVPGEHGAIGPPGLQGIRGE PGP PG 
LPGSVGSPGVPGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPG 
FlH3IJDMPGPircDKGAQGLPGITGQSGLPGLPGO^X5APGIPGFPG 
SKGEMGVMGTPGQPGSPGPHGAPGLPGEKGD\HGFPGSSGPRGD 
PGLKGDKGDVGLPGKPGSMDKVYMGSMKGQKGDQGBKGQIGPTG 
EKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPG I SGTPGAPGLP 


GPK93VGGMGIiPGTPGBKQVPGIPGPQGSPGLPGDKGM0GEICGQ 
AGPPGIGIPGLRGSKGDQGIAGFPGSPGBKGEKGSIGIPGMPGS 
PGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGI PGVRSEAGLPG 
TPGPTGPAGQKGEPGSDG I PGSAGEKGEPGLPGRG FPGFPGAKG 
DKGS KGEVGPPGLAGSPG I PGSKGBQGFMGPPGPQGQPGLPGS P 
GRAT RGP Fmni? fi Pftfin Pn t . vv~z r .wz DMn o dtct . on t rtnvKTiTYirrnjo 

gwpgapgvpgpkgdpgfqgmpgiggspgitgskgdkgppgvpgf 
^pkglpglqgikgdo^ixjgv'pgakglpgppgppgpydiikgep 
glpgpkgppglkgu^lpgpkgqqgvtglvgipgppgipgfdga 
pgqkgemgpagptgprgfpgppgpdglpgsmgppgtpsvdhgfl 
vtrhsqtrddpqcpsgtki lyhgysllyvqgnerahgqdlgtag 

ITGKNIRPFISRCAVCEAPAMVMAVHSQTIQIPPCPSGWSSLWI 
GYSFVMHTSAGAEGSGQ7VLAfiPGSCLBEFRSAPFIECHGRGTCN 
YYANAYSFWIATIERSEMFKJCPTPSTLKAGELRTHVSRCQVCMR 
RT 


579S 


1192 


61 


STRSPTVBYISAHPHIIjFMLLKGYEAPQIALRCGIMLRECIRHK 
PIAKI ILFSNQFPwDFFK^ELSrFDIASDAFATFKDLLTRIlKVL 
VADFTiEQN YDTI FBDYEKLLQS2NYVTK11QSLKLLGELTLDRHN 
FAT MTK Y I S KPENLKLMMNLIJ^KSPN IQFEAFHVFKVFVA6 PH 
KTQPIVKILLKNQPKIjIEFIjSSFOKERTDDEQFADEKNYIiIKQI 
RDIiKKTAP * RALRDS KR 


5796 


2 


1078 


GRVGWEIiWCMYISPPKDWWDAGDPSI»P irtpamigcs FWNDRKF 

TOEIGIiDPGMDVYGGEiaELGrKVWLCGGSMEVLPC^RVAHIE 

RJ3GCPYNSNIGFYTKRNALRVAEVW^DYKSHVYIAWN^ 

GIDIGDVSERRALRKSLKCKNFQWYtiDHVl^ 

I»RHNKAKWCbDQGPI£^AtLYP43^^ 

GALGTrTIJ*PDrmCLVmSK5RLP<}I^ 

NGAIMNKX5TGRCIjF^ElTOGIiAGIDLXUlSCTGQRWTIRW3rK*R 
EGAGAI^PGPQIX*iAAPPNlWTSCPGGETARGRQVIiDGPPRASPG 
QHRDPG 


5797 


2 


891 


PRVRQKTLVDVTLBNSNIKIWRNIi^ 
VAQVBNQLLKMKVESSQKANABVM^^ 

KHSAE KKAI.I»EETNS FLKAIEEANKKMQAAE I SLEBKDQRI GEL 
DRLIERMEFOiRHQLQLQLLEHETEMSGBliTDSDKERYQQLEEAS 
AS LRERIRHLNDMVIICQQKKVKOMVEEIES LKKKLQQKQLL IliQ 
IxLEKISFljEGKNJJEljQSRI^YIiTETQAiCTB 
SQTCRTRBIVMPSRKYTPYTRVLELTMKKTLT 


5798 


644 


115 


KITCSRWKSMSNQEKQP YYEEQARLS KIHLEKYPNYKYKPRPKR 
TC TVDGKKLR I G3 YKQLMRSRRQKMRQ FFTVG QQPQI P I TTGTG 
VVYPGAIT^TTTPSPQMTSDCSSTSA^PEPSLPVIQSTYGMKT 
IX^SIAGNEMINGEDEMEMYDDYBDDPKSDYSSENEAPEAVSAN 


5799 


2679 


1435 


LLSTYIKFINLFPETKATIQGV1RAGSQLRNADVELQQRAVEYL 
TLSSVASTDVLATVIJSBMPPFPERBSSILAK^ 
DDGRRDPSSNDINGGMBPTPSTVSTPSPSADLLGLRAAPPPAAP 
PASAGAGNLLVDVFDGPAAQPSLGPTPEEAFLSPGPEDIGPPI P 
EADBLLNKFVCKNNGVL FKNQLLQ I GVXSEFRQNIjGRMYIxFYGN 
KTSVOFQNFS PTWHPGDLQTOIAVQTKRVAAQ^ 
NIECLRDFLTPPLLSVRFRYGGAPOALTLKLPVTINKFFQPTEM 
AAOJDFFQRWKQI^LPQQEAQKIFKANHPMDABVTKAKLI^ 
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Amino acid segment containing signal peptide 
(As Alanine, C— Cysteine , D^Aspartic Acid, B— 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H^Hiotidine, I~Isoleucine, K«Lysine, 
L=Leucinc, MoMethionine, N-Asparagine , 
P=Proline, Q=Glutatnine r R=Arginine, 
S=Serine, T=Threonine , V=Valine # 
W»Tryptophan, Y- Tyrosine, x=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=pos3ible nucleotide insertion) 








LLDNVDPNPENFVGAG I IQTKALQVGCLLRl^PNAOAQMYRLTL 


5800 


2673 


1435 


LLSTYIKFINLFPETKAXIQGVIJlftGSQLRNADVBLC^ 
TLSSVA5TDVLATVLESMPPFPERESS ILAlGiKRKKGPGAGSAL 
DDGRRDPSSNDINGGMSPTPSTVSTPSPSADIXGLRAAPPPAAP 
PASAGAGNLLVDVFDGPAAQPSLGPTPKBAPr^PGPEDIGPPIP 
EADEIJ^KFVCK^GVLFENQIJ^IGVKSKFRQNI J GRMYI J I- r YCN 
KTSVQFQNPS PTVVHPGDIiG/rQIiAV(yrKRVAAQVDGGAQVQC^ 
NIEC^RDFliTPPLI^VRFRYGGAPQALTLKLPVTINKPPQPTEM 
AAQDFFORWKObSLPQQBAQKI FKANHPMnABVTKAKLIiGFGSA 
LLDNVDPNPENFVGAG 1 1 QTKALQVGCLLRLE PNAQAQMYRLTL 
RTSKEPVSRHLCEIiLAQQF 


5801 


3 


1413 


FPRLYHliIPDGElTSIKIKIRVDPSBSLSlRt.VGGSE'IPIiVHI 1 1 
UHIYRrX5VIARlX5RLLPGDIILKVN(^ISNVPHNYAVRLLRQP 
CQ VLWLTVMREQKFRS RNNGQAPDAYR PRDDS FHV1 LNKS S PEE 
QLG I KI..VRKVDEPGVPI FNVIJX^ VAYRHQQLiEEliuRVJjA NUn 
DLRYGSPESAAHL I QAS ERRVH1»WSRQVRQRS PDI FQEAGWNS 
NGSWSPGPGKRSNTPKPIiHPT itchekwm iqkdpge SLGMTVA 
GGASIIREiTOLPrYVISVBPGGVISRIX3RIKa^DII^VDGVEI>T 
KVSRSKAVAIxLKRTSSS IVLKALEVKEYE PQEDCSS PAALDSNH 
NMAPPSDWSPSWVMWLBLPR^YNCXDIVIMNTAGSI^Ft:^ 
GYEEYNGNKPFFI KS IVEGTPAYNDGR IRCGD ILLAVNGR STSG 
HIHACLARLIJCELKGRITLTIVSWPGTFL 


5802 


3 


— 

290 


Cfc"5 Jb YUirilSKXMUJjtrrJLiljKMAlfKlSMl^a VUCiitimnJfK J.K J.X JUULiPl 
GAFFYIiISPLDFVPEALFGIIiGFLDDFFVIFLIiLiIYI S IMYREV 
ITQRJjTR 


5803 


2234 


1299 


BAQFGTTAK I YAYREEQDFG I K I VKVKAIGRQRFKVLELRTQSD 
GI03AKVQILP3CVLPSTMSAVQLESLNKCQIFPSKPVSREDQC 
SYKWWQKYQKRJCFHCANLTSWPRWLYSLYIJABTIiMDRIK 
WDENLKDDSLPSNPIDFSYRVAACLPIDDVLRIQIjLKTGSAIQR 
UlCEriDIMNKCraLCCKO^ETEITTKNBIFSIiSIiCGPMAAYVN 
PHGYVHBTIiTVYKACMiNLIQRPSTEHSWFPGYAMTVA^ 
SHIGWKFTATKKDKS PQKFWGLTRSALLPT1 PDTEDE I S PDKVI 
Ld» ' 


5804 


2 


1707 


EMS' &QRQEKQRKRTE E RRKRRI EQDMLEKRK1 QRRLAKRAEQ IE 
DIlTOTGrESASEEGDDSMiITWPVKSYXrSGKMKKNFEDLBKE 

RBBKfiKXKi tsKUtU< JLKXJuSUKiro 

BSLS PGKLKLTFEEIJSRQRQQTOiaCO^BEBAlUQlLBBBKRAFBE 

ARRQMVNEDEBNQDTAKI FKGYRPGKLKLS FEEMERQRREDEKR 

KABEBAROTIF^EKKAFAEARRNf4VVDDl^PEMYKTISQEFLTP 

GKIiEINFEELLKQKMEEEKRRTBEERKHKT,EMKKQKFRQr,RQFiM 

GEKEKRNE-rFG^R^YEELIKLKRSGSIQAKNLKSICFEKIGQLS 

ETTRTQKKTRFEff ARFP ATPF' R TKFRBAIjn}fFHKFT>T>VTWRPAR 

EAPFTHKVNMKARFEQMAKAREBEEG^IE^QKIiRMQFEQREI 

DAALQKKREBEEEEEGS IMNGSTABDEEQTRSGAPWFKKPLKNT 

SVVDSEPWFTVTCVTGEPKPBI'rWWFEGEILQDGEDYQYIERGB 

TYCLYLPETFPEDGGEYMCKAVNNKGSAASTCILTIESKN 


5805 


3 


776 


YISDTLGC^KSKIRWWIKKNGGNGNISVDDLiIALIaDIiAEHASS 
AFKESC^QS BDREYE VKERE YPKS KRR YDTYNIAGYQGE I EVGL 
YTIQILQLI PFFDNKNELSKRY14VNFVSGSSDIPGDPNNEYKLA 
LFmiPYLTKiKFSLKKSFDFFDEYFVrJ^Rl^IKG^^AKTR 
RKVAGYFTOCYVDIFCIiLEE3QNJtnX3LGSKFSBPLQVE^ 
ALKADKFSGELEYLIKSQEDAISTMKCIVNEYTFLriK 


5806 


1257 


877 


AVFrKBO^GRTANLYSLIlSHLGI'rTVFIiFACXJRFLGFAVFLT.PW 
ASMWLRS LLKP IHVFFGAAILSLS IAS VI SGINBKLFFS I»KNTT 
RPYHS LPSEAVFANSTGMLWAFGLLVLYIIiTiAS SWKRP 


5807 


2267 


1302 


PJ'SKKTFTWPMAVDIQPAC^LYCGKTU.FKNGSTKIYGECGVC 
PRGQRTNAQKYCQ PCTBS PB LYDWLYEGFMAMLPL VLHWF F I EH 
YSGKKSSSALFQHITALFECSMAAIITLLVSDPVGVIiYIRSCRV 
rmSDWYTMLYNPSPDYVTTVHCTIIEAVYPLYTIVFIYYAFCIiV 
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Amino acid segment containing signal peptide 
(A*? Alanine, C= Cysteine, D=Aspartic Acid, K= 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
H=Hisfcidine, I— Isoleucirxe, K= Lysine, 
L=Leucine, M=Methionine, N^Asparagine , 
P=Proline, Q=Glutamine, R-Arginine , 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y- Tyrosine , X=Hnknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\ ^possible nucleotide insertion) 








I>1MLLRPliI»VKKIACGLGKSDRFKS I YAALYFFP I LTVLQAVGG 
GLLYYAFPYI ILVLSI.VTLAVYMSASK1 ENC YDLLVR KKRLI VL 
FSHWLLHAYG IIS ISRVDKLE QPLPI«IjALVPTPALFYLFTAKFT 

epsrilsegangii 


5808 


2 


433 


SLPDSGVVBYI^NGGVADNHKDFGBI^YWECLMNFSCNGKNGSS 
EGRITHGFQLKSAYEBINIjMPYTNYTFDPKGV^ 
IiGVtJBPLDPQV^VEiraiTGCPHPHIPSDHFSIiLTQLELHPPLLP 
LVNGVHLPNRR 


5809 


464 


2422 


ILVPGFQGILHPGVYCAIX3SQHQAQELV7u^IDECEVSGr,CRHGG 
RCVNTHGSFECTCMIXJYl^RNGPEPFHPTT^ 
EVPDGYIIGNYTSSIXJSQVRYACREGFFSVPEDTVSS CTGIjGTW 
ES P KLHCQE INCX3WP PEMRHAI LVGNHS SRLGGVAR Y VCQEGFE 
S PGGKITSVCTEIKGTWRESTLTCTEILTKINDVSLFinyrCVRWQ 
INSRPONPKISxVISIKGQRU)PMESVRHETVNt,TTDSRTPEVC 
LAL YPGTNYTVNISTAPPRRSM PAVI G FOTAEVDLLEDDGS FKI 
S IFNBTCLKI^RRSRKVGSB^lYQFWX^RVm^PSHATSPN 
FTTREQVPWCI^IiYPTTOYTVHVTIiLRS P KRHS VQI T IATP PA 
VKO^SNISGFNETCl^WRSIXTAD^^ 

AQEMTFNISSSSP^PEVCTiDLRPGTNYNVSLRAr^SSELPWISL 
TTQI TBPPLPEVEFFTVHRGPLPRIiRLRKAKEKNGP IS S YQVLV 
LPlJU^TFSCC^EG/LSSFFSNASDAIX^YVAAEIJJUaTVPDnAM 
HI PIGDRL YYGEYYNAPLKRGSDYCI ILRI TSEWNKVRRIIS CAV 
HAQVKDSS I*MIXQMAGVGLGSLAVVI ILTFLS FSAV 


5810 


3 


1641 


KVPGTHKDHBV5TLDTAI S AVKVQLAEFLENLQEKSIiRIEAFVS 
E IBS FFNT I EENCS KNE KRIiEEQNEEMMKKVLAQ YDB KAQS FEE 
VKKKKMBFLHEQMVHFLQSMDTAKDTI£TIVREAEEIiDEATO 
SFEEINERLLSAMESTASLEKNPAAFSLFEHYDD3SARSDQMLK 
QVAVPQPPRLE PQEPNS ATSTT IAV YWSMNKEDVTDS FQVYCME 
EPQDDQEVMELVEEYRLTVKES YCI FEDLEPDRC YQVWVMAVN F 
TGCS LPSERAI FRTAPSTP VI RAED CTVCWNTAT I RWR PTTPEA 
T ETY TLKY CRQ1 15 PEGEGLRS PSG I KGLQLKVNI>QPNDN Y F FYV 
RAIKAFGTSEQSEAALISTRGTRFLUiRETAHPALinsSSaTVI 
SFGBRRRI»TKlPSVLGBEl,PSCGQHYWB*rTVT^^ 
SAVQAGALGQGETSWYMHCSEPQRYTFFYSGIVSDVHVTERPAR 

KCTLHLGIBPPDSVRHK 


5811 


1918 


851 


AAAIJU0PLPEDKWSABKPJlPrJCSSIX5YEiTFSLIJ*PDPKSHDVY 

WDIEGAVRRYVQPFtNALGAAGaiFSVDS^ 

ASSSYYLDMHSI^RViCNPVESttl^SSAASLYPV^ 

K^PLYIQDKIX^VATNAniSPEWGOIMVYNVDSKTYNA3VI,PV 

RVEVl>!VRVMBVFiAQLPJLLTOlAQPQLPPKClVLS 

WELDRLLWARS VENLATATTTLTS LAQLLGKISN I VIKDDVAS E 

VYKAVAAVQKSAEKUVSGHLASAFVASQEAVTSSELAFFDPSLL 

HTJ.YPPDDQKFAI YI PLFLPMAVPI LLS LVKI FLETRKSWRKPE 

KID 


5812 


5204 


2744 


GGRQRCQPJSRSCGAREEEVEPGTARPPPAASAMDASLEKIADFr 
IiAEMGKNLKEAVTCMliEDSKiRRTEBEN^ 
^SII^VQNIdlHGDBDBEPQSPRIQ^^ 
YISTLDKEiGJlia^TTRILSDTTLWlX^FRYENGCAYFHBEERE 
GIAKICTRl^IHSRYEBFVVDGFKVI.YNKKPVIYLSAAARPGLGQ 
YL CHQLGbPFPCLCRVP CNTVFGSQHQMDVAFLEKL I KDDI ERG 
RLPLI^VANAGTAAVGHTDKIGRLKEIX!EQYG I WLKVEXXVITLAT 
LAI/SYVSSSVLAAAKCDSMTMTFGPI^ 

LTLVAGLl^NKPTDKLRALPLWI^LQYLGI^FVERrKHACQLS 
QPJ^ESIJCKVNYIKILVEDELSSPVVVFRFFQELPGSDPVFKAV 
PVPN^PSGVGRi^HSCDAlaNRWLGEOLKQLVPASGbTVMnT.EA 
EGTCLRFSFLMTAAVLG1TIGEDVW 

EEFKQEVFATAGLLYVnDPNWSGIGVVRYEHANDDKSSLKSYPQ 
Gl^IHAGI^KKIiNEI^SDLTFKIGPEYKSMKSCLYVGMASDNVH 
AAELVET IAATARE I EDNSRLLENMTEWRKGIQEAQVELQKAS 
BEPXLBEGVLRQIPVVl^VtJWFSP^ 
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Amino acid segment containing signal peptide 
(A« Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F— Phenylalanine, G=Glycine, 
H=Histidine, I^Isoleucine, K=Lysine, 
L=Lsucine, M=Methionine, N^Asparagine , 
P= Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=Threonine, V»Valine, 
W-Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=po3sible nucleotide insertion) 








TBPIYVYKAQGAGVTLPPTPSGSRTKQRLPGQKPFKRSLRGfiDA 
LSJKTSSVSHIBDLEKVEHLSSGPEQITLEASSTEGHPGAPSPQH 
TDQTEAFQKGVPHPEDDHSQVEGPESLR 


5813 


2936 


699 


HRDGVSGSLERPLTDRS RTGAFAQQRGKMATAGGGS GAD PGSRG 

IJiRI»I»SFCVIJOAGLCRGNSVBRKrYIPI^^ 

GCQSSISGIXK^IHVVBKEKDX^WVLTDGPWPPYMVLL 

RDLMBKLKGRTSRIAGLAVSIiTKPS PASGFSPSVQCPNDGFGVY 

SNSYGPEFAHCRKIQWNSLGNGIAYEDFSFPIFIiLEDENfeTKVI 

KQCYQDHNI^NGSAPTPPI*3U^IJ>S^ 

SIQS TFS INPKI VCDPLSDYNVWSMLKPINTTGTLKPDDRVWA 

ATRLDSRSFFWJV\APGABSAVASPVTQLAAAEAIjQKAPDVTTI» 

PRNVM PVFF^ETFDyiGSSRMVYDMEKGKFPVQIjENVBSFVET, 

GQVAIiRTSIiRliWMHTDP VS QKNBS VRNQVEDLLATLB KSGAGVP 

AVILRRPNQSQPLPPSSLQRFLRARNISGWJU^HSGAFHNKYY 

O^IYDTAENIl^YPBWI^PI J KE/ETNNFG*QD^ 

IX5RALYEIJUXmi FSDTV<^PCyrVTRIXYG\ FLIKANNSWFQS 

IUK5^I^YLG*RGLFQH\YIAV\SSP0OTIW/VI^YALANL 

TCTVVNLTREQCX)DPSKVPSENKDLYEY£WQGPIJISHETDRI.P 

RC^STARLARAL^PAFBI^QWSSTEYSTWTESRHKDIRARIFIj 

IAS KELELITLTVGFG I LI FSLIVTYCIIJAKADVXjFIAPREPGA 

VSY 


5814 


8500 

I 


432 


ALKCR PRRVIjAI LVGPVQPDRMAE EGAVAVCVRVRPLNSRKESIj 
GETAQVYWKTHNNVI YPVDGS KS FNFDRVLHGNETPKNVYEA\ I 
AAPI TDSAIQGYNGTIFA\YGQT\ASGKTYTMMGSEDHiGVIPQ 
GQ FHGHFSQKI * EVFLDRE FLI^VS YMEIYNBTI TOLL CGTQKM 
KPLI IRBDVNRNVYVADLTBEVVYTSEMAI.KH ITKGEKSRHYGE 
TKMNQRSSRSHTIFPJIII^SREKGEPSNCEGSVKVSHLNLVDLA 
GSERAAQTQAAGVRLKEG<^INRSLFir/WIKKI>SIXX3VGGFI 
NYRDSKLTRILQNSLGGNPKTRIICTITPVSFDErLTAI.QFAST 
AKYMKNTPYVNEVSTDE^IAKRYRKBIMDr»KKQLKKVSLBTRAQ 
AMKKDQLAQLLE EKDbLQKVQNBKl ENLTRMLVTSSSLTLQQ3L 
KAKRKRRVTWCLGKINKMKNSNYADQFN I PTNITTKTHKLS INL 
LRE1 DES VCSESDVFSNTI*DTL3EI BWNPATKLLNQEN I ES ELN 
SLPJu^YDNLVIJJYEQLRTEKEEMEliKIiKBKKDL^^ 
KIKJEMQLIHE ISNIiKNLVKHREVYNQDLSNELSS KVELLREKED 
QIKKLQEYIDSQKLENIKMDLS YSLES IEDPKQMKQTIiFDAETV 
ALBAKRB SAFLRS ENI*ELKEKMKKLATTYKIJMKiro IQLYQSQIiB 
AKKKMQVDLBKELQS AFNB ITKI»TSL I DGKVPKDLLCNL£IiEGK 
lTDLQRELmKVKBNKATJ?«RVIU^ELKSLPSEVERI>RKKTQD 
KSEELKI ITSEKDKLFSEVVHKESRVQGtLEBIGKTKDDLATTQ 
SNYTCSTDQRPQN FKTLI IMDFKQKYKMVLEENKRMNQE I KP, 
AQK^DSSLGA]^TBI>SYKTQEI^EKTREV^ERLNKMEQLKEQIJC 
NRDSPLQTVERE KTLI TE^CLQQTLEEVKTLTQEKDDLKQLfQE S I» 
Q IBRDQLKSDIHDTVNMNIDTQEQtjRliAJjE 
KISEEVSRNLHMEEWTGETKDEFQQKMVGIDKKQDLKAK1WQTL 
TADVKDNE I IEQQRKI FSLIOEKNELQQMLESV I AEKEQLKTDL 
KENIEMTI ENQEELR JjI^DBLKXGXJErVAOEKNHAI KKEGBLSR 
TCDRLAEVEEKIJCBKSQQIiQEKOQQUOT^ 
NLKNELKHKELTLEHMETERLELAQKIlfENYEEVKS ITKBRKVL 
KELQKS FETBRDHLRG Y I RE I BATGLQTKEEIiKIAH I HLKEHQE 
TIDBLRRSVSBKTAQI INTQDLEKS FfTKLOKE I PVLHEEQELLP 
KVKKVSETQETMNEIiEIiLTEOSTTKDSTTIAR IEMERLRLNBKF 
QESQEEIKSLTKBRDNLKTIKBAJb^^ 

SQSKQEO^LNMKElQONETTKIVSEMEQFKPXDSALIiR IEI EMLG 
r^KRLQESHDEMKSA^^KDDI^PXQBVLQSESDQLKENIKEIV 
AKK1JSTE EBLKVAHCCLKEQEBTIKEIjRVK1*S BKE^ 
LBArNDKLQNKIQEIYEKEEQLNIKQISBVQEKVNBLKQFKEHR 

kakdsalqsirskmleltnrlqesqkeiqimikekeemkrvqea 
ixjie3dqijcehtkbrwakmkbsqekbyqfi>kmtav^ 

IBHLKEQFETQKiOT^IETENIRLTQILHFJILEEIffi 
DLRSVEETUareRDQIiKEHUlOT^ 
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Amino acid segment containing signal peptide 
{A=Alanine, C^Cys teine, D-Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
Ifc=Histidine, loisoleucine, K=Lysine , 
L^Leucine, M=Methionine, N=Asparagine , 
P^Proline, Q= Glut amine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W-Tryptophan, Y-Tyrosine, X=Un)cnown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








QBTIDKIiRGIVSBKTNEISMMQKDIjEHSIIDAIiKAQDIiKIQBBLR 
IAHMHLKBQQBTIDKi^IVSEKTO^ 

IQKEiKAIJEHQLITIiKKDVNKTQKKVSEMEQLKXQIKDQSLTLSK 

I£IENLMIAQKUIEinU3EMKSVM^ 

ESLQETKARDLBIQQELKIIARMLSKEER^^ 

ISDlQKDLDKSimEIiQKKIQEr^KKEMl^VKEDVIWSHKKIN 

EMEQLKKQFEPNYLCKCEMDNFQLTKKLHESLEEIRIVAKERDK 

LRRIKBSbKMSRDQFIATLREMIARDRQNHQVKPEKRIXSDGQQ 

HLMESLREKCSRI KBLIJCRYSEMDDHYECLNRLSI.DLEKEI E FH 

RIMKKLKYVLSYVTKIKEEQHBCXNKFEHDFIDEVBKQKELIiIK 

IQHIOQDCDVPSREIJu^LKI^QNMDLHIEEILKDFSESEFPaiK 

NNFFNNRIIAXMNESTEFEERSATISKEWEQDLKSLKEKNEKLF 
KNYQTLKTSIiASC^QVNPTTQDNKNPHVTSRATQLTTE KIRELE 
NSLHEAKES AMHKB SKI I KMQKELEVTUD 1 I AKLQAKVHESNKC 
LEKTKKT I QVLQDKVALGAKP YKEE I EDLKMKLGK I DLEKMKNA 
KEFEKEISATKATVBYQKKVIRIXRFJILRRSQOAQiyrSVTSEHT 
DPQPSNKPLTCGGGSGI VQNTKALI LKS EH I RLEKEI SKLKQQN 
BQLI KQKNELLSNNQHLSNEVKTWKERTLKREAHKQVTCENSPK 
SPKVTGTASKKKQrrPSQCKERNIiQDPVPKESPKSCFFDSRSKS 
LPSPHPVRYFE«SSLGLCPBVQNAGAESVDSQP\GPHARLFQGK 
DVP\ECKTQ 




23 


1460 


S BI>VMWTVQNRES I£LLS FPVMITMVCCAHSTHEPSNMS Y VKET 
VDRLLKGYDIRIiRPDFGGPPVDVGMRI DVASIDMVSEVNMDYTL 
TMYFQQSWKDK31I»SYSGIPL^T^ 

S FVHGVTVKNRM I RLH PDGTVL YGLR I TTTAACMMDLRRY PLDE 
CNCTLEIF^YGYTTDDIEFYWNGGBGAVTGVNKIEIiPQFSIVDY 
KMVSKKVEFTTGAYPRr^LSFRLKRNIGYFILQTYMPSTLITIL 

C?W\7t2 K*WTRTVT>7\ C!R AT5'l7'IlT./3T'T»'TOrr "PWITTTCTTrr flpTT DlfTBVl/y 

anv&clVJXtxl/AiMMwVAlJtfl l 1 VLiirll 1 JL«a JLHwKJai-J-ilrKJ.x'X VK 

AID I YLMGC FVFVFLALUS YAFVNY I F FGKGPQKKGAS KQDQSA 
NEKNKLEMNKVQVDAHGNI LLSTLEIRNETSGSEVLTSVSDPKA 
TMYSYDSAS I QYRKPLS S RE \A*GRAP3RHGVPS KGR I RRRAS \ 
QLKVKIPDLTDVNSIDKWSRMFFPITFSLFNWYWLYYVH 


5816 


861 


191 


TSSRSRAAAQEG^AETPGSVERRGRRAGAEDGMSQAPGAQPSPP 
TVYHKROPT.T^^JVVHAT.NllVT^innT»P<;ftPAAnn TPTTPT Mm qdt. 

NPimSLI^TCNYDVNVIMAAI^X^IiAAVWWDRRRPr^OIALPQ 
VLGLILNLPS PVSIiGLLSLPI.RRRHLRWPCARr,/ VTV5YYNLD3 
K\LRAPEGPGGLRTE\*GPFIiAAALAQGI/^VIiLVVrKE^ 
SHLRTD 


5817 


851 


118 


RLFRGPGANRGRSCRGCSGGREPSGGA1,PKRHCPC*PPSPPAAD 
VMSNTTVPNAPC^SDSMVGYVl^F 

KKRVDRI>RHHIJJ>MYSxT)PAEEUffiAEQErj^DMGDPKVV\QAG 
RVATSTSGCECWMSRRIXLiTPLPHPSBPGVLDCl^PCHI^^ P 
GSPCWVIX^FSiaPPSAASA3HALTITSLPPGLLPFVt3VELTA 
HPQALMGRGF PSGMAAAGRHLCFI* 


5818 


3 


3918 


QALKDKLWIFLVQS FYAVRHTES WfaMSTDDQQKI QAAAFDKGD 
DRRLGXKP IFSS SQQPJCQVSDSGDIKI KS WRGNNKKECWSYLST 
NKKMKSDGLGASGHSSSTNRNS INKTLKQDDVKEKDGTKIASK I 
TXELKTGGKNVSGKPKTVTKSKTENGDKARLENMS PRQWERSA 
TAAAAA^KNLI^GKGVRNQEGQISGARPK^ 
PLKKATGKDSPCLSIAGPSSRSTDSSMEFS ISTECLDEPKKNGS 
TEEEKPSGHKLS FCDSPGC^MKNS VDSVKNSTVAI KSRPVSRVT 
NGTSNKKS IHEQDTWVNNSVLKXV'SGKGCSEPVPQAI I*KKRGTS 
NGOTAAQQRTKSTPSNLTKTQGSC^ESPNSVKSSVSSRQSDEjrV 
AKLDHNTTTEKQAPKRKMVKQVHTAL^ 

KGBTLNNKDSKQKM PPGQVI S KTQPS 6QRPLKHBTSTVQKSMFH 
DVRDNNNKDSVSF^KPHKPLINIJ^EISDAFJVbQSSCRP\DPQK 
PLNlX3BKEKIiALECQl«SKIJ)KSI>KHEI,ES^ 
HKETDDCDAANICCHSVGSDNVWS KFYSTTALKYMVSNPNENSI. 
NSNPVCDIJDSrSAGQIHLISDREWQVGRKDTWXQSS I KC V2DVS 
I^PEimfGTLNSAGKDKKSKVPVEG^^ 
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Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, B= 
Glutamic Acid, ^Phenylalanine, G=K3lycine, 
H=Histidine, I-Isoleucine, Kx=Lysine, 
L= Leucine, M=Methionone, N=Asparagine, 
P-Proline, Q=Glutamine, R=Arginine, 
S^Serine, T^Threonine, V^Valine. 
W=Tryptophan, Y=Tyrosine, X=Uhknown, *=stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








ATADSDVSSkCFSGQLSEKNSPKNMETSBSPESHBTPETPFVOH 

WNLSTX5VLHQRESPESDTGSATTSSDDIKPRSEDYDAGGSQDDD 

GSNDRGISKCGTMLCHDPLGRSSSDTSTPEELKIYDSNIiRIEVK 

MKKQSSNDLFQVNSTSDDEIPRKRPEIKSRSArVHSRKRENIPR 

GS VQFAQE IDQVSS SADBTEDERSEAENVAENFS I SNPAPQQFQ 

GI rNIiAFEHATBMECREFSAN KKFKRSVLIjSVDECBEIjGSDEGE 

VHTPFQASVDSFSPSDVPIX5ISHKHHGRTCYSRFSRESKDNILE 

CKONKGNSVCKNBSTVI^I^IDSSRKNKQ^VSATBKKOT 

SSRSRQLIJ^KKVNircsm^ENDIQC^KPlJ>SDVKSQERP^ 

DLHQaEPNSDIPKNSSTKSLDSFRSQVLPQEGPVKBSHSTTTEK 

AWIALSAGDIDDCOTL^^ 

KVES3THVTDMDF*DIX}HFAKQDWTIiLKQIjI^EQDSNIiDVTNSV 

PBDLSIAQYIilNQTI^IJVRnSSKPC«I^ 

SSAS ITMAS FSS EDCSPQGEWTILELETQH 


5819 


l 


5557 


AAAGLLCALHLVMTLVVAAARAEKEAFVQSRS 1 1 EfVLRPDDGGL 
LQTETTLGI^SVOQKSISLYRGNCR?IRPEPPMrJ)PlIEQPVaMP 
KMEKVYLIINPSSE*TITIiVSIFATTSHFHASFFQNRKILPGGNT 
S FDVS / VP1^VVGNVENTLFI1^SNIIGVFTY\QVF<TVGVPNPY 
FXRPFLGARVTVNSSFSPI INIHWPHSEPLQVVBMYSSGGDLHIi 
ELPTGQQGGTRKLI^IPPYETKGVMRASFSSREA1]NHTAFIR1K 
TNASDSTOFI ILPVEVBVTTAPGI YSSTEMJ^FGTLRTQDLPKV 
LNT.HLIiNSGTKDVPITSVRPTPQ\HDAITVHFKP ITLKAS \ESK 
YTKVAS IS PDAS KAKKPSQFSGKITVKAKE KS YSKtiEIPYQAEV 
LIX3 YLGFDHAATLFHIRDS PADPVERPI YIiTHTFSFTVIIjIHDVL 
LPEEAKTMFKVHNFSKPVLIIiPOTSGYIFTLLFMPSTSSMHIDN 
NILLITNASKFHLPVRVYTGFIiDYFVLPPKlEERFIDFGVLSAT 
EASNILFAlINSNPIELAIKSHHIIGDGMiSJELVAVDRGNRTT 

iisslpecekssssixjssvtlasgyfVavfrvkltakklvegih 
dgaiqittdyeiltipvk\avtavgsltcspkhwlppsfpgki 
vhqslnimnsfsqkvkiqqirsiisbdvrfyykrlrgnkedlepg 

KKSKIANIYFDPGLQC^DHC^VGXPFLSKSEPKVQPGVAMQEDM 
WDADWdLHQSLFKGWTGIKENSGHRLSAI fevntdlqkni iski 
TABI^WPSILSSPIUn^KFPLTNTNCSSVEBEITLENP/SQDVPV 
YVQFI PI^YSNPSVFVDKLVSRFNI^SK^AKJDI^TI^FQVFRN 
SAHPI^SSTGFMEG\l^PHLIIiNLILKPGEKKSVKVK\FTPVHN 
RTVSSLI I VRNNLTVMI^VlW0GX3GTTENUiVAGKIjPGPGS SLR 
FfQTEALLKIXrrDSLKIiREPNP^ 

ISGYSCEG YGFTCVVNCXJEFT1.SANASRD I IILFTPDFTASRVT R 
ELKFITTSGSEFWIIllASIiPY^ttATCAEAI^RPNWKEALYI I 
I SG IMS ALFLI*VIGTA\ YI*EAQGI HE P \ FRRRLS \ FEASNPPFD 
VGRPFDLRRTVGISSEGNLin^LSCDPGHSRGFCGAGGSSSRPSA 
G3HKQ*GPSGHPHSSHSNRNSADVDDVRAYNSGRTSSMTSAOAA 
SSQPANKTRPLVLD5NTGAQGHS AGRKS KGAKQSQHGSQHHAHS 
PLBQHPQPPIJPpVPQPQBPQPERliSPAPIJUIPSHPERASSARH 
SSEDSDITSLIBAMDKDFDHHDSPALHVFTEQPPSPLPKSKGKG 
KPIOTKVKPPKKQEEKEKlOGKGKPQEDELKBSIMDDSSSTrTE 
TSNPDTEPLLK£DTEKQKGK0^PEKKESEMSQVKQK3K3OiLNI 
KKEIPTDVKPSS^LPYTPPIiESKQRRI^liPSKIPLPTAMTSGSK . 
SRNAQKTKGTSKL VDNRPPALAKFLPKSQELGNTSSS EGBKDS P 
PPEHDSVPVHKPGSSTDSLYKI^LCmJ^rFXTORQTSPTPAS 
PSPPAAPCPFVARGS YSS IVHS SSSSDPKI KQPNGS KH KLTKAA 

HAPVDS DGSDS SGLHS PVSNPSS PDFTPLNS FSAFGNSFNLTGE 
VPSKLGLSRSCNQASQRSWNE FNSGPS YLWES PATDPS PSWPAS 
SGSPTHTAT£VLGNTSGLWSTTPFSSSIWSSNLSfiAI.PFTTPAN 
TLASIGLMGTENSPAPHAPSTSSPADDIiGCrrYNPWRIWSPTIGR 
RSSDPWSNSHFPHEN 


5820 


310 


1270 


RVSL^PVSLGVLLCARSSTMGKRDDmVAYMNPlAMARSRGPIQ 
SSGPTIQ\VI * IDQGLPGKK* KSN * KRKRK/DS KALAB FEEKMN 
EIWKKF1>EKHREKLLSGSESSSKKRQRKKKEICKKSW* \DSSSS\ 
SSSSDSSSSSSD3EDFJ)KKQGKRRKKKKNR$HXSSESSMSETES 
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Amino acid segment contaming signal peptide 
(A=Alanine, C=Cysteine, D-Aspartic Acid, B- 
Glutamic Acid, ^Phenylalanine, G^Glycine, 
KrrHistidine, l=Isoleucine, K^Lysine, 
L-Leucine, M=*Methionine , N=Asparagine, 
Pt= Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Tbreonine, VaValine, 
W=Tryptophan, Y^Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








DSKDSLKKKKKSKDGTEKBKDIKGLSKKRKMYSEDKPLSSESLS 
ESEYIBBVI^KKKSSEBREKATEKTKiaCiaanCKHSKKKKKKAA 
SSSPDSP*H*EKSGFPYKBSAMSEE ISTVKTTTYIjLKCMNFIiVF 
GIIPGLFSSHSDATV 


5B21 


179 


915 


kwrxwswrwpkpgtnwmi»srevotpjrvtwt^ 

pt/ikdcsiaatgkrpsarfphqrrkkrrkmddglaeggpqrsn 

tyviklfdrsvdi^fsbotplypiciy^wmrnspsvrerecsps 

splpplpedeeg\sevtnsksr*cvqacppthtpggqpknacr\ 

sripsplaaijlmcx5tp*rwspfbpepspstliyrnmqrwkrrrq 

rwkeashrnqlrysbsmkilremyerq 


5822 


464 


4379 


QflXKEMPIVMARDLBErASSSEDEEVISQEDHPCIMWTGGCRRI 
PVLVFHADA I tiTKDNNI RVIG BR YHLS YKI VRTDSRLVRS ILTA 
HGFHEVHPSSTDYNLMWTGSHLKP FIJLRTLSEAQKVNHFPRS YE 
LTRKDRLYKNI I RMQHTHGFKAFH ILPQTFLLPAEYAB FCNS YS 
KDRGPWIVKPVASSRGRG\VYLIIWPNQISLEEHILVSRYINNP 
IJjIDDFKPT3VRLYVIiVTSYDPIjVIYIiYEEGIaARFATVRYDQGAK 
NIP^QFMHIiTNYSVNKKSGDYVSCTDP3VFJ)YGNKW^AMLKY 
LKQEGRDTTALMAHVEDLIIKTIISAB^^ 
FEIiYGFDVLIDSTLKPKLLEVNIiSPSIAmAPU>LKIKASMISD 
MFTWGFVOQDPAQRASTRP I YPTFESSRRNPFQKPQRCRPLSA 
SDABMKNLVGSARE KGPGKIjGGS VLGLSMEBI KVLRRVKBEKDR 
RGGFIRI FPTSF^BXYGS YI^HKTSMNYMLATRLFQDRMTADG 
APELKI*Sl^SKAKI J HAAI J YERKLLl^LEVKKRRRRSSRIjRAMI^ 
KYPVITQPAEMNVKTETBSEEEEEVALDNEDEEQEASQEBSAGF 
LRENQAKYTP8LTALVENTP KENSMKVREWNNKGGHCCKLETQE 
LBPKFNLMQI LQDNGNLS KMQAR IAFSAYLQHVQ I \RI*J KDSGG 
OTFS AS WAAKKDEQMBLWRFLKRASNNLQHS LRMVL PSRRLAL 
LKRTRILAHQLGDFI 1VYNKSTEQMABKK5KKKVEBBEEDGVNM 
ENFaEFIRQASRAFXEEVT»TFYTQKNKSASVFIX3TH5iaSKNNN 

WVQn<5/lA Jf dr\VDVPXUtf?V\ftrx in)DVrtrtnTan?nii?nvT crtr^mmrt » 

m * oua*Lmt\j\\aLjn*rss i xMtsjs v ivi af j/nyyy l x BlxioJWKliSRFTTSA 

EKEAKLVYSNSSSGPTATLQKIPfJTHLSSVTTSDLSPGPCHHSS 

LSQIPSAXPSMPHQPTIMiNTVSASASPCIiHPGAQRIPSPTGLP 

RCRSGSHTIGPFSSFQ3AAHI YSQKOSRPSSAKAGS CYLiJKHHS 

GIAKTQKE^EmSLYSKRYNQSMVTASIiQRIAEKQAARQYSPSS 

HrNLLTQQVTNLNIATGI INRSSASAPPTIiRPI ISPSGPTWSTQ 

SDPQAPE3raSS5PGSRSLQTGGFAWE^EVENNVYSQATCVVPQH 

KYHPTAGS YQLQFALQQLFQQKLG^RQLIiIXJSRARH FGSQT 

LPNSNLWTMNNGAGCRISSATAiXK2^^ 

VPKPPPNHEQVLRRATSQKASKGSSABGQLNGI^^ 

ITSSTDPAHTKTMNHKHTEKQPVHH3WVHD 


5823 


42 


2293 


LLTAI>SWEGGGGRI)EPSAC31AGDVNMDDPKKBDI LLLADEKFDF 
DI^I^SSSANEIJDBVFFGPFGHXERC^LAASLF^liNPVPEQPPLP 
TSESP PAWS PLAGE KFVEVYKEAIILLALH I ES SS RNOAAQAAKP 
EDPRSQGVER.FIQESKP\KINI»FEKEKEMKKS PTSLKRBTYYLS 
DS PLLGPPVGEpRLLAS SPALPSSGAQARLTRAPGPPHSAHALP 
RES CTAMAASQAATQRKPGTKLLL PRAAS VRGRG I PGAAEKPKK 
EIPASPSRTKIPABKESHRDVLPDKPAPGAVNVPAAGSHLGQGK 
RAIPVpXNKLGLKKTLLKAPGSYSNVIjQRKSSSGAXVWSGASSA 
CTPQPVAKAJCSSEFASIPAN*LPGI^PNISKS\GRMGPAMIJRPA 
I*\PAGPVG\ASSWOAKRVDVSKIiAAFX^TAPP\SASPTQPOTPE 
GGG\aWIiNSSCAWSESSOLJTKTRSlRRJiDSCI^SKTKVMPTPTN 
QFKI PKFS IGDS \PDSSTPKLSRAQRPQS CTSVGRVTVHSTPVR 
RSSGPAPQSIiLSAWRVSAIJ>TPASRRCSGIiPPMTPKTMPRAVGS 
PL\CVPARRRSSEPRKNSAMRTEPTRESNRKTDSR\LVDVSPDR 
GSPP3RVPO^UiNFSPEESC«TFSKSTATF^U«FAKPGGnAAPS 
EALLVDIKLEPIAVTPDAASQPI>I DLPt, I DFCDTPEAHVAVGSE 
SRPLIDLMTNTPDMNKNVAKPSPVVGQLIDLSS PLIQISPEADK 
ENVDSPLUCF 


5824 


42 


2293 


LLTAI^MECX^PJ^EPSACRAGDVNMDDPKKEDIUjLADEKFDF 
DLSLfiSS£ANET)DBWFGPFGHKERCIAASLELNNPVPEQPPLP 
TflR^PFAWSPIiAGEKFVEVYlCEAHLLAlJa 



380 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 

NO; 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, (^Cysteine, D»Aspartic Acid, E*> 
Glutamic Acid, pr= Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=I#ysine f 
L=Leucine, M=Methionine, N-Asparagine , 
P=Proline, Q=Glutaraine, R^Arginine, 
S-Serine, T=Threonine, V=Valine, 
W-Tryptophon, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








EDPRSQGVKRFIQESKFXKINLFEKBKEMKKSPTSLKRETYYLS^ 
DSPLLGPPVGEPRLLASSPAt»PSSGAQARLTRAPGPPHSAHALP 
RKSCTAHAASQAATQRKPGTKLIjLPRAASVRGRGI pgaaekpkk 
EIPASPSRTOPAEKBSHRDVLPDKPAPGAVNVPAAGSffliGQGK 

RAI PVP \nklglkktli*kapgs ysn\lqrxsssga\vwsgassa 

CTPQFVAKAKSSEFAS I PAN * LPGDCPN 1 5 K5 \GRMGPAMLRPA 

L\PAGPVG\ASSWQAKRVDVSKLAAEQLTAPP\SASPTQPQTPE 

GGGVQWX^SSCAWSESSQI^KTRSIRP^ 

QFKIPKFSIGDS\PDSSTPKI»SRAQRPQSCTSVGRVTVHSTPVR 

RSSGPAPQSLl^AWRVSAIiI>TPA3RRCSGI*PPMTPKTMPRAVGS 

PL\CVPARRRSSEPRKNSAMRTEPTRBSNRKTDSR\ LVDVSPDR 

GSPPSHVPG^VI^FSPEESDQTFSKSTATEYAREEAKPGGDAAPS 

BAIiIiVDIKriEPIAVTPBAASQPLlDLPLIDFCDTPBAHVAVGSE 

SRPLIDLMTNTPDMNKNVAKPSPVVG<)LIDLS 

ENVDSPLIiKF 


5625 


2 


4210 


FliQIESASPAPFSSGFLAAHPHSPGGSIATKGRSRI^SAPGMI^ 
SAAPPAP PPEVTATARPCLCSVGRRGEGGKMAAAGAI>ERSFVEIi 
SGAERER PRHFRBFTVCS IGTANAVAGAVKYSESAGGFYYVESG 
KI>FSVTRNRFIHWKTSGDTIiEJ^EESI^ INLLNNAIRIiKFG^ 
VLPGGVYVSETQNRVI I IiMKJ^QTVHRIiliLPHPSRMYRSEr.VVD 
S QMQS I FTD I GKVD FTD PCNYQL I P AV PG I S PNS TASTAWLS S D 
GFJu^FALPCASGGIFVLKTiPPYDIPGMVSVVBLKjQSSVMQRLLT 
GWMPTAIJffiDO^PSDRPI^LAVHCVEHDAFIFA^ 
YKEQMObMVADHLEYVPVKK^^ 

GIF\MHAPKRGQFClFQLVSTESNRYSIiDHISSIiFTSQETLIDF 
ALTSTDIWALWHDAENOWVK^INFEHNVAGQWNPVFMQPLPEK 
EIVTRDDQDPREMYLG^l.FTPGQFTWEALCXAIiQIFCRGTERNL 
DI^WSEIiKKEVTLAVEirEIXX?SVTEYEFSQEEFRi^QQEFWCKF 
YACCLQYQEAIjJS HPLALHLN PHTNMVCXiIiKJCG YTjS FT* I PfiSLVD 
HLYLLPYEKLIiTEDETTI SDDVDIAKDVICLI KCLRLI EES VTV 
DMSVIMEMSCYNI^PBKAAJBQII^OTTIDVEl^EDICSKLQ 
EIRKP IHAIGl*L IRQ4D YETEVEME KG FNPAQ PLNIRMNLTQLY 
GSNTAGYI VCRGVHKXASTRFLICRDLIiILQQLLMRLGDAVI W 
TGQLFQAO£DLLKJITAPIjLIjS YYL IKM1SE 

C^HLSrVLELTDSGAlJIANR FVSSPO^ IVELFFQEVARKH I ISHL 
FSQPKAPLSO^I^PEMITAITSYLLQr^WPS 
GNCQYVQLQDYIQIiLroWCQVNVGSCRFM^ 
BCFCXJAASEWOEFIjDRLIRSEDGErVSTPPXQYYDKVIJlLriD 
VIGLPEL VIQLATSAI TEAS DDW\ KS QATL \ RTCI FKHHL\ DLG 
\HNSQAYGSL * PQI PD^SRQLIXXRQLVVVLCKREQLQDLVEFS 
YVNLB^IE VVGI lESRARAVDLMIHNY^ 
TVMPBYGMRIX3RBVRTIJlGLBlu2GKCYIiAA^ 
VQPVSGAVYDRPGAS PKRNHDGECIAAPTNRQI E I LELBDLEKE 
CSIJu^lRlilljAQHDPSAVAVAGSSSAEEMVT^ 
IiOQTFKLPLTPVFEGLAFKC I KIiQF^X^AAQABAWAWr AANQLS 
SVITTKESSATDEAHRO^TYIiBEYKVQNNLYlfflCV 
VPI.PNWLINSYKKVDAAELIJiLYLNYDLLDLTPYQVIRlCJQC 


5826 


3 


871 


KSQLLRDHSAPPPKPCTSVGAMGC* PRQ/ SPKEQQRQLKKQKNR 

AAAOilSRQKHTDKADALHO^HESLRKDNLAJb^ 

WSRTI^VHERiCPMIX^CSAPGIiI^CWDQAEGLIiGP^ 

CREQLELFQTPGSCYPAQPLSPGPQPHDSPSLLQCPLPSLSLGP 

AVVAEPPVQl^SPSPLLFASHTGSSLQGSSSKLSALQPSLTAQTA 

PPQPLBLBHPIllGKIXSSSPDNPS^AIiGIJUUjOSIffiHKPAr^AAT 

WQGLWD P S PHP LLAF PLLS SAQ VH F 


5827 


194 


2287 


GMGSENSALKSYTLRBPPFTLPSGLAVYPAVLQDGKFASVFVYK 
R^NEDKVNKAAKVP**HWCTLRH2^^ 

RVQ PliEVAJUETLS SAEVCAG I YD ILLAIjI FLHDRGHIjTHNN VCL 
SSVFVSBDGHWKXGGMETVCKVSQATPBFLRSIQSIRDPASIPP 
E EMS PE FTTLPECHGHARDAFS FGTLVES LLTILNEQVS ADVIiS 
SFQO/TI^STLIJWPI PKHRPALCTLIjSH 

TLKSEEEKTE FFKFLLDRVS CLS EELIAS RLVPLLLNQLVFAEP 
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Amino acid segment containing signal peptide 
(A- Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G^Glycine, 
H=Histidine, I=Isoleucine, K»Lysine # 
L=Leucine, M-Methionine, N=Asparagine, 
P= Proline, Q^Glutamine, R=Argxnine, 
SoSerine, T=Threonine, V-Valine, 
W-Tryptophan, Y-Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








VAV\KSFIJPYI^PKKDHAQGETPCIjLSPALFDSRVIP\nXQI,F 
E^HEEH\mMVLLSHIi^YVGAijSI*REQLKKV\IL\PQVLLn\LR 
D\ TS DS I VAITLHSLAVLVSLLGPEWVGGERTKI FKRTAP \ S F 
TK\NTDI»SLEGDPFSQPIKFPINGLSDVKNTSKDSBNFP5SSKK 
SBBWPDWSGPB\EPENQTWI\QIWP\RKP\C^EVKSC^TTIJ)V 
BBSSWDIX:BPSSIiDTKVNPG<K3ITATKPVTSGEQKPIPALLSLT 
BESMPWKSSLPQKISIjVQRGDDADQIKPPKVSSQBRPLKVPSEIi 
GLGKEFTIQVKKKPVKDPEMDWFADMI PB I KPSAAFDTLPELRT 
BMVPKKDDVSPVMQFSSKFAAAKITEGKAEGHEEEGEIJWEDMN 

w 


5828 


2 


■ 257 


AREGGSLGAVAACEELSYSCDFCPARPHTSWjTRFVKMEFQAW 
MAVGGGSRMTDLTSS I PKPLLPVGNKFL IWY?LNLLERVGFEEV 
IVVTTRDVQKALCABFKMKMKPDIVCIPDDADI^TADSLRYXYP 
KLKTD VLVLS CDLI TD VALHEWDL FRAYDAS LAMIjHRKGQDS I 
EPVPGQKGKKKAVEQFJ>FIGVDSTGKRIiLFMANEADLDEEI>VIK 
GS ILQKHPRIRF}rTGI*VDAHLYCLKKYIVDFLMENG\SITSlRS 
HI»\ I PYLV/RGKQPS SASSQQGTRKEKEGGSKGKRGLKSFRIS Y 
SFY*KEANYTGTGAPY\D\ACWI 


5829 


260 


1259 


PDGRIjI VSCSEDId IKIWDTTNKQCVNNFSDSVGFANFVDFNPS 
GTCIASAGSDQTVKVWDVRVNKI^LQHYQVHSGGVNCISFHPSGN 
YL I TASSDGTLKI LDIiLKGRL I YTLQGHTGPVFTVS FSKGGELF 
ASGGADTQVLLWRTNFDELHCTOSD^ 

PRTPHPHEBTCVBTVEDFFLHIJ^IQSIiR*SIC!RSLLPL^ISF 
LLILPQQQKPVVGLCX}^VKRPVDIS*TLP*CHQNVCQQPRKRK 
QKT*VTSPVKVK/VSIPIAVTimi^IM 
LTLTED FCLKD CLENQQ KLFSAVQQKS 


5830 


4496 


3139 


GGKMAAPEBRDLTOEOTEKLIiOFODLTai ESMDOC5?HTTiVf)HNW 
N I EAAVQDRLNE QEGV PS VFNP PPS RPLQ VNTADHR I YS YWSR 
PQPRGLLGWG YYL IMLP FRFTYYT I LD I FRFALR FIRPDPRS KV 
TDPVGD I VS FMHS FEE KYGRAH P\TFYGGTYSQAliNDAKREIiRFL 
LVYLHGDDHQDSDEFCRNTLCAPEVISLINTRMLFWACSTKrKPE 
GYRVSQALRENTYPPTiAMIMLKDRRE * PV\ VGRLEGLI \QPDDL 
INQLTFIMDANQTYLVSERJjERBERNQTQVLROXXJDEAYLASLR 
ADQEKERKKRBBRERKRRKKEEfVO^XJKLABERRRQNLQEEKERK 

LECLPPEPSPDDPESVKI ifklpndsrverrfhfsqsltvihdf 

I*PSLKESP\EKFQIEA\NFPRR\VI>PCIPSBE\WPNPPT1jQE\A 
GLSHTEVLFVQDLTDE 


5831 


71 


2897 


FCSKDKCCLYLPDSINRSXSCrAKPGAHSQDRHAVMDSBRQVKD 

TDD IESPKRS I RDSGYIDCWDSERSDS LSP PRHGRDDS FDSLDS 

FGSP^RQTPSPDVVLRGSSDGRGSDSESDLPHRKLPDVKKDDMS 

ARRTSHGBPKSAVPFNQYX.PNKSNQTAYVPAPLRKKKAEREEYR 

KSWSTATSPAGIiGKKALQDYGPRT\PVS\DDABSTSMFDMRC3E 

EAAVQPHSRARQEQLQL INNQLRBBDDKWQDDLARHKSRKRSVS 

QDDI KKEEKRKKME KliLAGEDGTSERRKSIKTYRB IVQEKERRE 

RJBI^EAYKNARSQEEAF^ILQQYIFJtFTISFJVVLERIiFWPKXLB 

RSHSTE PNJbSS FLNDPNPMK YLRQQSLPPPKFTArVETT I ARAS 

VU7TSMS AGSGSPSKTVTPKAVPMLTPKPYSQPKNSQDVLKTFK 

VDGKVSVI^TVHREEEKERECPTVAPAHSLTKSQHFEGVARVH 

GSPLELKQDNGS IE INI KKPNS VPQBIiAATTEKTE PNSQBDKND 

GGK5RKGNIELASSEPQHFTTTWRCSPTVAFVEFPSSPQLKND 

VSEEKDQKKPEMEMSGKVELVLSQKVVKPKSpBPKATIiTFPFLD 

KMPEANQIJILPNLNSQVDSPSSEKSPVTTPFKFWAWDPEBERRR 

QEKWQQEQBRIiIX3ERYQ\KEQDK\LKEE\WEKAQKEVEEEERRY 

YKBEP+ 1 1 \ EDPWPFTVSSSSADQLSTSSSMTEGSGTMNKIDL 

GNCQDKKQDRRWKK5FQGDDSDLLTJCTRESDRLEBKGSLTRGAL 

AHSGNPVSKGVHEDHQLDTEAGAPHCXTIWPQ^ 

PTH5S EDVXPKTLPLDKS INHQI ES PS ERRKS 1 $GKKI*CSS CG Jj 

PLGKGAAMI IETLNLYFHIQ CFRCG\ I CKGQLGDAVSGTDVRIR 

NGIJjNOIDCYMRSRSAGQFTTL 


5832 


2454 


829 


PGRRPRHGSCAFQKQCIMIjHI cq yflqgeckfgtsckrshofsn 
SEOTiEM^KLGMSSDLVSRLPTI YRNAHD IKNKSSAPSRVPPLF 
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Amino acid segment containing signal peptide 
(A=Alanine, C»Cy3teine, D=Aspartic Acid, E- 
Glutamic Acid, F*= Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K-Lysine, 
L=> Leucine f M=Methi on ine , N^Aeparagine , 
P= Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=Threonine , V= Valine, 
W-Tryptophan, Y^Tyrooine, X -Unknown, *=Stop 
Codon, /=po33ible nucleotide deletion, 
\-possible nucleotide insertion) 








VPQGTSBRKDS SGS VSPNTLSQEEGDQICLYHIRKS GS FQDKCH 
RVHFHIiPyRWQ^IJJRGKWEDLDNMELIHEAYCNPKIKRILCSES 
ASTPHSHCLNFNAMTYCMQA^ 

WSDEPGSWQEYGRQGTVHPVTTVSSSDVEKAYLAY/WYTGV*R 
PGSHLEVPGRKAQLR VR EQSLRCE KPGLWHN* KGLPQTQ IR\AP 
QDVTTMQTCNTKFPGPKS I PD YWDSSALPDPGPQKITLSS3SEB 
YQKVWNIiFNRTIjP FYFVQKI ERVQNIiALWEVYQWQKoO^IQKONG 
GKAVDERQLFHGTSAI PVDAI COX)KFDHRVCGVHGTS YG KGS Y F 
ARDAAYSHHYS 2CS DTQTHTM FLARVLVGKFVRGNAS FVRPP AKE 
GWSNAFYDSCVNSVSDPSIPVIFEKHQVYPBYVIQYTTSSKPS^ 
TPSILLALGSLFSSRQ 


5833 


170 


3289 


SIIiCl^PCVVQFaKTWSILSSRSRHSPCTKKGWF^MRKHI^HT 
RQGHK* VHVE I S KALWVYRDDYF I RHS ISVS AVT VRAWI THKYR 

GYIWNLRANRIPO^PLENDVVALLGFPYASSGBNTGIVKKFPRF 
RNRE1*KATRRQRMDYPVFTVSLWLYLIjHYCKAMLCGII*YFVDSN 
KMYGTPSVFLTESGYI^IQMHLVKGEDIAVKTKFIIPLKEWFRL 
DISFWGGQIVVTTSIGQDLKSYHNQTISFREDFHYNDTAGYFII 
GGSRYVAGIEGFFGPI»KYYRJ^SLHPAQIFNPIJiEKOJLAEQI^ 
YYBRCABVQE I VS VYASAAKHGGERQEACHLHNS YLDLQRRYGR 
PSMCRAFPRK KELiKDKH PS kPQALLBMDLLTVPRNQNBSVS E I G 
GKIFBKAVKRLSS IDGLHQISS IVPFLTDSSCCGYHKASYYLAV 
FyBTGLNVPRDQLQGMLYSLVGGOGSBRLSSM^^ DN 
YPIJ>WEIjSYAYYSNIATKTPI^HTLaGIX)AYVm , IHr,KDDEII» 
KVO/ITCEDGDVFWnUKREATR^ 

EAAIEWYAKGAI^TEDPALlYDYAIVLFKGQGVKKNRRIJaELM 

KKAAS KGI£OAVNGI^ WYYHKFKKNYA\K7VAKYWLKA\EE \MGN 

PDASYOTiGVWajJGIFPGVPGRNQ^ 

WCSLYYITGNLETFPRDPEKAVVWAiOIVAEK^ 

AYIiEGSWHEAI»LYYVIiAAETGIEVSQTNIAH^ 

GVNCVWRYYN FS V FQ IDAPS FAYLKMGDI*YYYGHQNQ5QDliEI*S 

VQr^AQAALIXJDSO^FFIJIiALLlEEGTIIPHHIIiDFI^IDSTI^ 

SNNISILQKLY^RCWSHSNEBSFSPCSLAWLYLHLRIjLWGAILH 

SALXYFICTFLI^ILIAimrQYFOSVSASDPPPRP^ASPDTAT 

STASPAVTPAADASDQDQPTVTNNPEPRG 


5834 


17 


4020 


RFRRGGGRVFPGAFPASPSDSLGQGNSQGPPRTPKPPRT/QBCG 

3AAPGP 1 PGQS3S * VPlaOiEQIO^KAIX^I^I^I^KPRMAAQV 

TTiEDALSNVDLT.KELPLPDOQPCIEPPPSSLLYQPNFNTNFEDR 

NAFVTG IARY I EQATVHS SMNEMI«KBGQBYAVMLYTWRSCSRAI 

PQVKCMEQPlTOVEIYEKTVBVI^PEVTKI^FMYFQRmiBRFC 

GliVIiRLC^RPRTOFVSFAYT.ITLGKFINMFAVIJDELKNMKCSV 

KNDHSAYKRAAQFLRKMADPQS IQESQNLSMFLANHNKITQSLQ 

OJI^ISGYEBIJJ^I\«LCVDYYBimMYLTPSElQ^^ 

GLYIMDGSVSNIYlOinAjCKRINI^KIDKYFKQLQVV^ 

FXARYIKTSAHYEENKSRWTCTSSGSSPQYNICEQMIQIRBDI1M 

SQWSAHVMEVYSWKLVHPTDKYSNKDCPDSAEEYERATRYNYTS 
EBKFALVEVXAM I KGLQVl^GRMESVFNHAlRHTVYAAIjQDPSQ 

RGBKDPKSG*DI KVPRRAVGPSSTQIiYMVRTOLBSLIADKSGSK 

KTLRSSLEGPTILD I EKPHRES FFYTHL INFSETLQQCGDIiSQL 

WFREFFLELTMGRRIQFPIEMSMPWILTDHILETKEASMMEYVIi 

YSIiDIjYNDSAH YALTRFNKQFLYDE 1FJUSVNLCFIX2FVYKLADQ 

IFAYYKVMAGSLIxI^KRLRSECKNQGATrffliPPSNRYETLIiKQR 

HVQLLGRS IDLNRLITGjRVSAAMXTCSLBIiAlGRFBS BDLTS I VE 

I£)GI^EINRMTHKIJ^RYLTLI^ 

HVPWBliNYl)FIiPNYCYircSTNRFVRTVl#PFSQE 

QYLHGS KALNLAYS S I YGS YRNFVG PPHFQ VTCRLLGYQGIAVV 

MBBLLKVVTCSIJiO^TI LQYVKTtiMBVMPK^ 

EFFHHQLKDIVEYAELKTVCFQNuPvEVGfl^ 

EVCDIJxHAAPFQNILPRVHVKEGBR^AK^ 
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Amino acid segment containing signal peptide 
(A^Alanine, OCysteine, D=Aspartic Acid, E- * 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=His t idine , I^Isoleucine, fk=Lysine, 
L^Leucine, M^Methionine, N~Asparagine, 
P=Proline, Q=Glut amine, R=*Arginine, 
S=»Serine , T=Threonine , V«Valine , 
W^Trypt ophan , Y=Tyxosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\ -possible nucleotide insertion) 








IiIERLGTPUQIAIAREGDIXTKERLCCGIiSMPEVILTRIRSFIi) 
DP IWRGPLPSNGVMHVDE CVEFHRLWSAMQFVYC I PVGTHE FTV 
EQCFGDGLHWAGCMI 1VLI/3QQRRFAVIjDFCYHLLKVO;KHDGKD 
EIIKNVPUO<MVERIRKE^IIJroEIITILDKVIJ^GI>GEGTPVE 
HVRCFQPPIHQSIjASS 


5835 


4209 


1904 


SGWIRMAQGSHQIDFQVLHDIjRQKFPEVPEVWSR(^I^ 

imCTAVLSQBSTRYT^YGBGDl^FSDDSGISGLRJNHMTSIAXDI^ 

SQNIYHHGREGSRMtJGSRTLTHSlSDGOLQG<K^NSKLFQQEPQ 

TAPAQVPQGFNVFGMSSSSGASNSAPHLGFHLGSKGTSSLSQQT 

PRFNPIMVTLAPNIG^ROTPTSLiailGVPPPVLIISPQGNSIYI 

RPYITTPGGTTRQTQQHSGWVSQFNPMNPX»VYQPSQPGPVmx: 

PASNPI>SHTSSQQPNQQGHQT3HVYMPrSSPTrSQPPTIHSSGS 

TSSTSSSVNSQTI^RWQFI^YIAASPPNTDKLMSRSQPKVYISA 
NAATGDEQ VMRNQPTLF I S TNSGASAAS RNMSGQ VSMGPAF I HH 
HPPKSRAIGNNSATSPKVVVTQPNT\KYTFXITVSPNKPPAVSP 
G WS PTFELTBrtJiNHPDHYVETKNIHHLTDPTLAHVDRIS E 
LSMGSDDAAYTQDI *RISNSHI/3MVAHACNSSAI»GGQDGR 1 1 +A 
QEFETSWGNT WRLRIiYRRF *NYAGMVJVHTCSP3Y3VD * ALLVHQ 
KARMEJUjQREtiBI QKKKLDKLKSEVNFJ4ENNLT IS 
QI PS IjKEMQQLRS CNRQLQ I D I DCLTKE IDLFQARG PHFNPS AI 
HNFYDNIGFWPWPKPKIXJRSIIKTPKTQDTKDDBGAQWNC1A 
CTFLNHPALIRCBQCEMPRHF 


5B3S 


361 


2303 


raiTMCGICCS VNFSAfelHFSQDt»KHDLI»YNI*KQRGPNSS KQLLK 
SDVNYQCIiFSAHVLHXiRGVLTTQPVEDERGNVFLWMGKl FSGIK 
VSAEKNDTQILFNYIiSSCKNESEILSLFaEVQGPWSFrYYQASS 
HYXWFGRDFFGRRSIiLWK^SNLGKSFCLSSVGWrSGr^NQWQE 

VKFQQTYQHLYX)R*QMKPNC I LKNLLFL* I * COHCIiHWRLI AVI 
FPMCHLQER YFKS FLIjMYT * KEVTQQFI DVLSVAVKKRVLCL PR 
DHNLTANE VLKTCDRKANVAI LFSGG I DSHVIATLADRHI PLDE 
PIDLLNVAPIAKBKTMPTTFNREGNKQKNKCEIPSEEFSKDVAA 
AAADSPNKHVSVPDRITGRAGLKBIiQAVSPSRIWNFVEINVSME 
BI^KLRRTRlC^IRPUyrVLDDSIGCAVWFASRG IGWLVAQEG 
VKS YQSNAKWLTG I GADEQIiAGYSRHRVRFQSHGLEGIiNKE I M 
MBIiGRl S S RNLGRDDRVIGDHGKEARFPFLDENVVS FLNSLP I W 
BKANLTLPRGIGBKLLL^^ 
KMEKINEKASDKCGRLQIMSIiENLSIBKETRIi 


5B37 


4792 


903 


NGN AVAQAP VTNCCYI>ATGSKBCTIR I WS CS RGRGVM I LKLPFL 
KRRGGG I DP TVKERLWLTI>HWPSNQPTQI>V8 8 CFGGELLQWDLT 
QSWRRKYTLFSASSEGQNHSRIVFNLCPLOTFJDDKQIjIjI^TS^ 
RDVOCCWDIATI^CSWTIiPSIiGGFAYSLAFSSVDIGSLAIGVGDG 
MIRVWl!rn»S IKNNYDVKNFWQGVKS^ 
DDGKVGljYDTYSinCPPQISSTYHKKTVYTI^ 
GDRPSLAIiYSCX^EGIVLQIINPWKIjSGEAFDINIKIjIRDTNSIKY 
KLPVHTET SWKADGKIMALGNEDGSIEI FQ\ IPNLKLICTIQQH 
HKLVNTIS WHHK\HGSPAQKLS Y1»\MPSGSQQCS PFTCHNLKKC 
P * KAAPE S PSDPLQS P YRTP PQGHTAQD YPVHAWBPH IH* WBGL 
WCFPID^YSPGCWD\AFPGK£AI^IFRG\HOGRLLCVAWSPL 
DPDCXYSG VADDFCVHKWLTSMQDHSRP PQGKKS IBLEKKRLS Q 
PKAKP KKKKKPTLRT P V KI.BS I DGNKE BSMKKMSG P VEWG VSDQ 
EGEEQAREPELPCGIAPAVSREPVICTPVSSGFEKSKVTINNKV 
ILLKKEPP KEKPETL I KKRKARSLbPLSTSLDHRSKEELHQDCL 
VTiATAKHS RE LNEDVSADVEERFHIi3IjFTDRATLYRM ID I EGKG 
HLENGIIPELPHOJjMLWKGDLKGVLQTAAEP^BLTDI^VAMAPAA 
GYHVWI^VEAFAKOLCFG^QYVKAASHriLS tHKVYEAVEIiLKS 
NHFYREAIAIAKARLRPSDPVLKDLYIiSWGTVL^ 
CTCiG ATCAYDAAKVIiAK3GGJlAASI^T^ 
LRCAQELIiliANNWVGAQKALQI^ESLGXKJRL 
FJCQLSFX3KSS$SYHltTOTGTEGPFVERVTAVWKSIFSIJ>TPEQY 
QRAFQKLQNI KYP^TlIbrrPAKQl»tJjHl C^LTLAVLSQQMASW 
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Amino acid segment containing signal peptide 
(A^Alanine, OCysteine, D=Aspartic Acid, E- 
Glutamic Acid, F= Phenyl alanine , G^Glycine, 
H«=Histidine, I~Isoleueine, K=Lysine, 
L-Leucine, M«Methionine, N=Asparagine, 
paproline, Q=»Glutamine , R=*Arginine, 
S=Serine, T=Threonine , V^Valine, 
W-Tryptophan, Y^Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








DFAVQALLRAVVRSYDSGSFTI MQBVYSAFLPDGCDHLRDKLGD 

HGSPATPAFKSLKAFFLYGRLYEFWWSLSRPCPNSSWVRAGHR 

TLSVEPS<X)IiDTASTKErDPETSQPEPNRPSELDIiRLTEEGERM 

LSTPKELFSE KHASIiQNSQRTVABVQBTLAEH IRQHQKSQIiCKS 

TANGPDKNBPEVEAEQPIjCSSQSOXIKEEKNEPLSLPELTKRIjTK 

ANQRMAKFPESIKAWPFPDVI^CCLVLIJjIR 

QAQEI.LQKYG11TKTYRRHC0TPCM 


583B 


110 


98 


KTMPHLLVTFRDVAIDFSQEEWBCH>PAQPJ)LYRDVMLENYSNIi 
ISLDLESSCVTKKLSPEKEIYEMES\PSGRIWGNVSTITPQYNG 
I^DNMECKGNLEGOVSKSEGLYMCVKITCBEKATESHSTSSTFH 
RII /HYQGKI VKCKBCRQGFSYXSCLIQHKEHHNI*KCSEVNKH 
RNTFSKKPSYI*HQ\KFRLGEKPYECMECGKAFGRTSDLIQHQK 
XHTNEKPYQCNACGKAPIRGSQLTKHQRVHTGEKPYDCKZCGKA 
FSYCSQYTLHQRIHSGEKPYECBCDCX5KAPILGSQLTYHQRIHSG 
EKPYECKE03KAPILGSHLTYHQRVHTGEKI>YICKECGKAEXCA 
SQrJ^HQRIHTGEKPYECKEOGKTFFRGSQLTYHLRVHSGERPY 
KCKECGKAFISNSNLIQHQRIHTGEKPYKCKECGKAFICXJKQLS 
EHQRIHTGKKPFECKECX5KAFIRVAYLTQHEKIHGEKHYECKEC 
GKTFVRATQLTYHQK I HTGEKPYKCKECDKAF/HLWLT ILSEHQ 
RIHRGKKPYBCKQCGR/l^IRG£HL/NEm.RTHTGEKPYECKEC 
GRAFSRGSEHTLHQR I H1T3KKP YTCVQCGKDFRCPSQLTQHTRL 
HN*RYSSHKICMHSIALASLDFAHLQEKNPBN 


5839 


1 


2425 


" GRPFPRPPRAI»PRLPLPJGRRQDGRWTVDFEBCLKD\SPRFRAAL 
EEVEGDVAEIiELKLXDKLVKLC IA\MIDTG KAFCVANKQPMNG I 
RD\ LAQNS \NNDA\ WET KFAPS FliDSLQEM IUFHTIL/ L* PUS 
EIN* GHS FQNFVKEDtJRKFKDAKKQFENSQ* KRKKIALVKNAPV 
PSRPASLEL*K^PNILTATRKCFRHIAIJ)YVLQINVLOSKRRSE 
ILKSMLSFT^AHIAFFHQGYDLFSBLGPYMKDLGAQLDRLVGZJA 
AKEKREMEQKHSTI QQKDFSRDDS KLKYNVDAANGIVMEGYLFK 
RASWAFKTWNRRWFS IQJ3HQVVYQKKFKDN PTVWE DLRLCTVK 
HC^IERRFCFEVVSPTKSCMUiADSEKLRQAWrKAVQTSI \AT 
AYRBKDDESEKLDKKSS PSTGSLDSGNESKEKLLKGESALQRVQ 
CTPGNASCXTDCXSLADPRWASINIiGITLCIB 
VRSLTLDTWB P FJjLKLMCELGNDVII^VYEANVEKMGI KKPQPG 
QRQEKEAYIRAKYVKRKFVDKIFTi*SLSPP\BQQKK\FVSKSSE 
EKRLS I SKFGP \GDQVRASAQSSVRS19DSGIQQS SDDGRES LPS 
TVSANS LYE PKGKRQDS SMFI1DSKHLNPGLQLYRASYEKHI1PKM 
AEAIJU^GADVNWANSEENKATPLIQAVLGGSLV^ 
VSQRDVQGRGPIJffiATWLGHTGQVCLFLKRG^ 
LS IAVEAANAD I VTLLRIJU?HNBBMRESEGLYGQ PGDETYQD I F 
RDFSQMASNNPBKLMRFQQDSQKF 


5840 


698 


3610 


XHLHLPRQHLTTLWQIS SPRWRS pQRAFMSALSKTQTQSAPAIiQ 
GLSSIiLQSVTGNPVPA5BAASQSTSASPANTTVYTI KGRRLPSS 
AQPFI PKSFN YS PNSSTSEVSSTSASKAS IGQSPGLPSTAFKL P 
SimCGFTATHNTSPAAPPTEVTICOSSEVSKPKL\ESESTSPSL 
\2MKIHNFLKGNPGFSVA* NLKHPNPAGSLGSS APSESHPSDFQ 
RGPTSTSIDNIDGTPVRDBRSGTPTQDEMMDKPTSSSVDTMSLL 
SKIISPGSSTPSSTRSPPPGRDESYPRELSNSVSTYRPFGLGSE 
SPYKQPSIXSMBRPSSIjKDSSQBKFYPDTSFQEDEDYRDFEYSGP 
PPSAMMNLQKKPAKSlI^SKI£DTTEYOPIIiSSYSfIRAQEFGV 

SPSKNDSFPTPDSNHNSI^QSTTGHLSLPQKQYPDSPHPVPHRS 
LFSPQNTLAAPTGHPPTSGVEKVIJVSTISTTSTIEFKNMLKNAS 
RKPSDDKHFGQAPSKGTPSDGVSLSNLTQPSLTATDQQQQBEHY 
RIETRVSSSCLI)LPDSTEBKGAPIBTLGYHSASNRRMSGEPIOT 
VES IRVPGKGl^GHGREASRVGWFDLS TSGSSFDNGP SS AS ELA 
SLGGGGSGGLTGFKTAPYKERAPQFQESVGSFRSNS FNSTFEHH 
UPPSPI^GTPFQRBPVGPSSAPPVPPKDHGGIFSRDAPTHLPS 
VDI*SNPFTKBAAIJUIAAPPPPPGE«^IPF^rPPPPP?PGEHSS 
SGGSGVPFSTPPPPPPPVDHSGWPFPAPPLAEHGVAGAVAVFP 
KDHSSLLQGTI^HFGVIJWPRDHGGP1X2RDI^GPGI^RV^ 
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Amino acid segment containing signal peptide 
(A=Alanine, C=: Cysteine, D»Aspartic Acid, E= 
Glutamic Acid, F- Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K~Lyuine, 
J,=Leucine, M=Methionine, N-Aeparagine, 
P^Proline, Q^Glutamine, R*Arginine, 
S=>Serine, Threonine , V= Valine, 
W=Tryptophan, Y=Tyrosine, X-tJnknown, *=Stop 
Codon, /=*possible nucleotide deletion, 
\=possible nucleotide insertion) 








TI»PSHSLBHLGPPHGGGGGGGSNSSSGPPW3PSHRDTISRSGII 
LRSPRPDFRPREPPLSRDPFHSLKRPRPPPARGPPFFAPKRPFF 
PPRY 


5841 


1908 


762 


GLRLFLVLTVWPfWKPSWI^RTEraKRLIiCRTLWCQSGWSSRSY 
TRSMLKMTTSINRRSRTCTKSTRTSARPGLTATV5IGLSDSPTW 
imCHMTARSC^GEXGGHWAmQVGVYLLPGRVGCVSSRVSPSFP 
GIXSLDSGIARRGSAVSAIJ^LVEEPMLGPPFHPTPRFKAVSAK 
S KFD L VS QG FT E FT I ED FI INT FM DT_» T EQ VE KQ TS VADLL AC FND 
QSTSDYI»VVyiJ^TSGYI«RBSKFFKHPIEGGRTVKEFC2Q\QB 
\VEPMaCBSDHIHIlALAQGLQRVHP<3WBYMGPRPRAATTNPIII 
FP*GLP^PKVYIiLYRPG\HYDILYKIGLGSSPLGCPGCPLLARA 
LGHCYRGFSWVKWSYFTPPFLSHDPPPMFY 


5842 


307 


1918 


QBPTADFKLRSTOSCXJF^MTCPDKPGQLINWFICaUr^RVRKL 
HSSRRPRTRRNI.IiIiGTACAIYIiGFLVSQVGRASI^HGQAABKGP 

RS KPANTRGTVXPKRR KXHAVASAAPGQEALVGPSIjQPQBA\EG 
KLML*HLGTLREQTWLRLBSDPGGWC^^ 

RBSNIRIYSBSAPSWriSKDDrRRMRLLADSAVAGUiPVSSRSGA 
RLLVLEGGAPGAVLRCGPS PCGLLKQPLDMSEVFAFHLDR I I&h 
NRTIiPS VSRKABFIQIXSRPCPI ILWDASLSSASNDTHS SVKLTW 
GTYQQIJ^QKCWQNGRVPKPESGCTEIHHHBWSKMALFI)FIiLQI 
YNRLDTNCCG FR PRKEDACVQ^LRPKO^GSAALAH XIQRKH 

LRQKLLQSLFLDKGYWBSQGGRQGI EKLXDVIEHRAKILITYIN 
AHGVKVXjPMNK 


5843 


500 


1453 


GTARLVTCWVLHGQ*VKKPAWEPGVVWI,*Q*RCRPKGWGIiGAGM 
R33F^3QPPQCLRPJK?SSCCHFMVKLIJ)rXnTMIPGEKVAHTSL 

dalvtfhqqkpieprrelltqpcrqkdpanvdyedlfi.ysnava 
beaacpvsapeeaspkpvlchqsxerkpsaem/rqnnhqgshfl 
lppki ps wrdppeteiee pqnaprepjpegpaaakzpprhcblvvt 
lgcpeihgdlrpvtorkrqprslrgshi^qri.hgslcgiiisqkp 
ltapgtkrqkgphqegrevgqlh*gd prgqelapngs es p i jbpg 
vqarapglgra 


5844 


202 

• 


2471 


FDSAVLS S INVMAVIjPG PJjQLIiG VLIiTI SI>SS I Rkl QAGAY YG I 

KPLPPQIPPQMPPQIPQYQPLGQQVPHMPIiAKDG^ 

QYGKEYPHLP^YMKEIQPAPRMGKBAVPKKGKBIPLASLRGEQG 

PRGEPGPRGPPGPPGLPGHGI PGIKGKPGPQGYPGVGKPGMPGM 

PGKPGAMGMPGAKGEIGQKGBIGPMGIP * PQGPPGPHGLPGIGK 

PGGPGLPGQPGPKGDRGPKGLPGPQGIiRGPKGDKGFGM PGAPGV 

KGPPGbffiGPPGPVGLPGVGKPGVTGFPGP\QGPW5K\PGAPGEP 

GPQGPIGVPGVQGPPGIPGIGKPGQDG\IPGQPGFPGGKGEQGL 

PGLPG P PGLPG I GKPG FPGP KGDRGMGGVPGAUG PRGK KGPIGA 

PGIGGPPGEPGLPGIPGPMGPPGAIGFPGPKGBGGIVGPQGPPG 

PKGEPGLQGPPGKPGFIiGBVGPPGMRGFPGPIGPKGEHGQKGVP 

GLPGVPGLLGPKGEPGIPGDQGLQGPPGIPGIGGPSGPIGPPGI 

PGPKGEPGLPGPPGFPGIGKPGVAGI^GPPGKPGALGPQGQPGL 

PGPPGPPGPPGPPAVMPPTPPPOGEYIiPDMGLGIDGVXPPHAYG 

AKKGKNGGPAxBMPAFTAELTAPFpPVGAPVKFNKIiLYNGRONY 

NPQTGIFTCEVPGVYYFAYIIVHCKGGNVWV^ 

EYTCKGFTjDQASGSAVIiLLRPGDRVFLQMPSEOAAGLYAGQYVHS 

SFSGYLIiYPM 


5B45 


215 


2061 


HASNKSASLQD KMAN P KE KTAM CLVNELARFNRVQ PQYKLLN BR 

gpahskmfsvqi^lgeqtwesegssikkaqqavgnkaltesti«p 
kpi *kppksnvnnnpgcitpwe^glamkrg\kpaihrpldpk 
PFPNNRANYNFQVMYNQRYHCP I PKI fyvqltvgnneffgegkt 
rqaarhnaamkalqalqnepiperspq^^ 
bislvfeialkrnmpvsfevikbsgpp^kspvtrvsvgefsae 
gfjgnskkt^kkraattvt^elkklp plpvvekpk\nffkkrpkt 
ivkagpeygck5mnpisriaqiqqakkekepdyvi.lsergmprrr 

EFVMQVKVGNEVATGTGPNiCKIAiaCNAAEWlIM 
DQLEKTGBNKGWSGPKP<3FPEPTM!IT^ 
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Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, E=» 
Glutamic Acid, F-Phenyl alanine, G=Glycine, 
K^Histidine, Idsoleucine, Kakysine, 
T»~Trf=*iifr i tip- M— Mp»t-hH eini np N— TVorvnTicrinp 
P=Proline, Q=Glutamine , R-Arginine, 
S=Serine, T-Threouine , V=Valine, 
w=Tryptophan, Y»Tyrosine, X ^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








NGTSSTAKAIGLKGSSPTPPCSPVQPS KQI»KYLARIQGFQVHYC 
DRQSGKECVTOjTLAPVQM-rFnAIGSS IEASHDQV* YATAILLC 
YGPARKWKAIKMEAMCAHAM.1*^ 
N- 


5846 


1126 


456 


FSKUCKKTPI IGISGVTNSGKTTLAKNLQKHIiPNCSVISQDDFF 

KPESEIETDKNGFLQYDVLEALNI^KMMSAISCW^ 

TDQ^AEEIPILIIEGFIiLFNYKPLDTIHNRSYFLTIPYHECKR 

RRSTRVYQPPDSPGYFIX5HVWPMYIJCYRQEWQm 

KSEBDLFLQVYEDLIQELAKQKCLQVTA* RRNTTNPS / CK* IRK 

TiQGVI 


5847 


2769 


505 


APEMEDLS SPDSTLLQGGHNLLSSAS FQESVTFKDVI VDFTQEE 
WKQLDPGQRDLFRDVTLEHYTHLVS IGLQVSKPDVISQLKQGTE 
PWIMBPSIPVGTO^WETRIiENSVSAPEPDISEEBLSPEVTVEK 
EKRDDSWSSNLLBSWBY^SI^QQANQQTLPKKIKVTElCriPS 
WBKGPVNNEFGKSVNVSSNLVTQEPSPEBTSTKRSIKQNSNPVK 
KEKSCKCNEOGKAFSYCSAIJCRHQRTHTGEKPYKCN* / CVEKAF 
S RSENLXNHQR IHTGDKPYKCDQCGKGFIEGPSLiTC^IORIHTGE 
KPYKC»ECGKA?SQRTHLVQHQRIinX3BKPYTCNEC!GKAFSQRG 
HFM EHQKIHTGEKPFKCDE CDKTFTRS THLTQHQKIHTGEKTYK 
C^ECGKAFNGPSTFIRHHMIHTGEKPYECNECXSKAFSQHSNLTa 
HQKTHTGEKPYBCAECGK3FSYWSSIiA(^LKIHTGEKPYKCNEC 
GKAFSYCSSLTQHRRIHTR^KPFECSECX5KAFSYLSNIiNQHQK^ 
HTQE KAYECKECGKAFIRSS S LAKHER IHTGEKPYQCHECGKTF 
* SYGSSLIQHRKIHTGKRPYKCNEQ3PJVFNUNIHLTQHKRIHTGA 
KPYECA3CGKAFRHCSSIAQHQKTHTEEKPYQCNKCEKTFSQSS 
HLTQHQRIHTGEKPYKCNECDKAFSRSTHLTQHQRIHTGBKPYK 
CNECGKXTFSQSTYLIQHQRIHSGEKPFGCNDCGKSFRYRSAZiN 
KHQRLHPGI 


5848 

• 


22 


2961 


AAPRRLIiRGGBGDRTPRFPLPAUjRPGPPAEAAPERRKMPAVSK 

GDGMRGIAVFISDIRNCKSKEAEXKRINKKLANIRSKFKGDKAL 

DGYSKJGCWCKIiLFrFLLGHDIDFGHMEAVNLLSSWRYTEKQIG 

YLFISVIjVNSNSELIRIjINNAIKNDIASR^ 

G3REMAEAFAGE I P KVLVAGDTMDSVKQ SAALCLLR L YRTS PDL 

VPraDTOSRVVHLLNlXS^ 

SLAVSRliS\RIVTSASTOLQDYTY*FCPGFI^ 

PPPT}PAVRGPOiTECLETIUnCAQEPPKSKKVQHSNAKHAVI»FEA 

ISLIIHHDSEPNI^VRACI^LGQFLOJiRETNLRYIALESMCT^ 

SSRFS HEAVKTHI ETVTNAIiK.Trt kDVS VRQRAVDTJ.YAMCDRSN 

A?Q IVAEMLSYLETADYS IREBI VLKVAII^AHKYAVDYTWX YVD 

TILNLIRIAGDWSEEVWYRVlQrTOlRDDVQGYAAKTVFEAI^ 

ArilUlBfl Ju V UVuvr I X LAjCjI: \jLi -L> i-^r\\jUtrt<-^J o i. LtJ.\JL 11 1 i j iH o RJ? It-Li 

CSVPTRAIXLSIVIKFVNI^BVKPTIQDW^ 

(X?RAVEYLRLSTVASTDlXATVLEBMPPFPERE5SIWyCL 

GPSTVTDLEDTKRDRSVBVNGGPEPAPASTSAVSTPS PSADLLG 

LGAAPPAPAGPPPSSGGSGLLVDVFSDSASWAPLAPGSEDNFA 

RFVCK^GVLFENQLLQIGLKSEFRQNLGRMFI FYGNETSTQFL 

NFTPTLICSDDI^PNl^IOTKPVDPTVEGGAQVOjQVVNIBCVSD 

FTEAPVmiQFRYGGTPQmrSVQLPlTLNKFFQPTEMASQDFFQ 

RWKQLSNl^QQSVO^IFKAKHPMDTEVTKTVKIx^ 

NPANFVGAG I IHTKTTQ I GCLLRLE PNLOAQWYRLTLRTS:<EAV 

SQRLCELLSAQF 


" §849 


3545 


1895 


KRRKIKETVFHHVAQAGLEUjSSSNPPSSASRSAGITGMRHQVQ 

P*DPCMSLSPPCFTBEDRFSLEALOTIHKQMDDDKDGGIBVEES 

DEFIRBDMOKllATNKHSHLHRRDEKITIEDLWKRWKTSEVH^ 

TLEDTLQWLIEFVELPQYEKNFRDNHVRtlTl'LPRIAVHEPSF 

SQUCISDP^H^QKLQLKAIiD\A^FGPLTRPPHN^KDFILTVST 

VICJVGGCWFAYTQNKTSKEHVAKh^^ 

LliKAQEI^RIWAVEKQNL+RKMMDElNYAKEKACRLRELRKGAH 
CRLSRRQYAEQEI^OVRMALKKABKEFKLRSSWSVPDALQKWIjQ 
LTHEVBVQYYNIKRQNAEMQIjAIAKDEABK^KKKRSTV 
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Amino acid segment containing signal peptide 
(A-Alanine, C=Cysteine, D=»Asparfcic Acid, B= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H-Histidine , Isoleuc ine , K= Lysine , 
L=Leucine, N=Methionine, N=Asparagine, 
P=Proline, Q=Glut amine, R=Arginine, 
SaSerine, T»Threonine , VWaline, 
W^Tryp t ophan , Y-Tyrosine, X=0nknawn, *aStop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








AHSSSLDEVDHKII^KKALSELTTajRERIiTRWQQIKKICGFQ 
IAHNSGLPSLTSSLYSDHSWVVMPRVSIPPYPIAGGVDDLDBDT 
PPrVSQFPGTMAKPPGSLARSSSLCRSRRSIVPSSPQPQRAQLA 
PHAPHPSHPRHPHHPQHTPHSLPSPDPDILSVSSCPAIiYRNEEE 
KKAIYFSAKKQWEVPPTASECDSLNSSIGRKQSPP/SKPRDIPN 
IIS/DBRYQBMRCP*RI PSGGIL 


5850 


3 


1895 


KAVIJ^SASGS VXSIiTGSNPMHDASMWHLKKNGI IVYLDVPLLN 
III CRIjKLMKTDRIVGOJISGTSMKD^ 

SGASPEBVADKV!^TKRY0m)SBTPIST!aWWPEDCEQKVSA 
EFFI EAV I EGIJ^DGGLFVPAKEFPKLS CGEWKS LVGATYVERA 
QILLERCIHPADI PAARLG KMI BTAYGENFACS KIAPVRKLSGN 
QFILELFHGPTGSFKDItSIiQLMPHI FAQCI PPSCNYM I LVAT3G 
DTGSAVI^FSRLNKNDKQRIAVVAFFPENGVSDFQKAQI igsq 
RENGWAVGVBSDFDFCQTAIKRIFiroSDFTXSFLTVEYGTI LSSA 
KS INWGRXiLPQWYHASAYLDLVSQGFTSFGS PVDVCI PTGNFG 
K ILAAVYAKMMGI P I RKFI CASNQ15MVHTOFI1CPG\HYDLRGKB 
N * AQTFFTVQ* I FL PNLSNLEimi^LMANKDGQLMTKLFNRLES 
QHHFQIEKALVEJCI^QQDFVADWCSEGECLAAIl^TYilTSGYIIiD 
PHTAVAKWADRVQDKTCP VI I SSTAHYS KFAPAIMQALKIKE I 
NF^SSSQLYLIX?SYNALPPUiEALLERTKQQEJ(MEYQVCAAI>lN 


5851 


3120 


1B02 


RCTLQFI^I^TSTSARAAAAIAAAEKPAGSPSVMTRAGDHNRQ 
RGCCGSLADYI*TSAKFIO»YLGHSLSTWGDRMWHFAVSV^ 
GNSLIxLTAVYGLWAGSVLVLGAI IGDWVDKNARLKVAQTSLW 
QNVSVIIiCG I ILMMVFLHKHELLTMYHGWVLTSCYILI ITIANI 
ANIiASTATAlTIQRDWIVVVAGEDRS 
LAPMAVG^IMTFGSPVIGCX3FISGWNIjVSMCVBYVIiLW 
PAIiAVKAGLKEEETEIJCQI^LHKDTEPKPLEGTHIiMGVKDSNIH 
BLBHEQEPTCASQMAEPFRTFRDGWVSYYNQPVF/liGWHGSCFP 
LYDCPGIj*IiHHHRVRIiHSGTEHFHPQYFDGS I SYNWNNGNCS FY 
LATSKMWFGS DRSDLRIGTAFI»FDIjVCDLCIHAW KP PGLVRFSF 


5852 


1 


422 


K'rrPPSSLCPLRQLPEVRGVSGQPLTDPLISLCRSHKCRGKGWG 
SSS YPSLPALLRARSAPGHCniRSCGPBWRIDSI SRLF^QGAim 

sgwaqaqptii^vprlrksi*psiwg/slmgpfitsgpg/wfrq 

YYPFISGRH* VI*FTBSDFY YVAMDFGOHGLSSHYS PGVPYYXQT 
BATSRIRRVVAGKKQ3VYFRRCX3GCSRAPPLITGGGVGSRKQRHP 
ESGAWAIAPGLPAIHGRSWES 


5853 


223 


1346 


RI^LSRVKGtHGPAASAWISDPETRGDPGGPWGW»RG6DLRPR 
FVSLTGLTI>VC^*AAQGPQV\HSVia^^IiGG\PCLL\FPIFRP 
LIIJIPPJIPRI^PGTRGVAVEPHAI^VVHVAHGEFJ^GIRAAGPGH 
G0VEIPQG/VGSLGARRGLRPSRPSSRHRNRVPAPPPGRPIATP 
HHRRFPPDPALTCPGI/^DQGPREQQKjQGSGRHin'ILGDWGESE 
SRWVRGNFRTGTAArLIGFSRNPTLNGSBNWGSLVS IQEEGPDT 
GKEREKRNPAEMGlJPQRWASPIHTPPIiGPBIU^PEALRAMPE 
ALGLRPDPATS VP SALS /QTF/ PESWPRSCLRNQGETLGMGPVP 
LSSLCITESPSQNWTPCLLLLTCPRGLF 


5854 


at, 


938 


KGRNTAP EKKGAALNNRENAS S * NGY / SRWKQDI RR I ENH 1 1 QE 

LXHLCAMIKRVLI^RLEimUCLRELTEGRT^^ 

RQI VTE YREKGKRN * EEKKRDLEGRSRRYNLCI IG I PETEDRAS 

GAETIKDLLE/ENFPBLKNBriDLQMEKAHR I PLKFNEKKAASRH 

IRVTFL/KFQRRWILQASSQRKQVTYKGAlCVRLTSDFSPAIIiNA 

RROW/N/PISRVLRRNNFEPRI IYSAKLSFLYKGNWKTFLDXQG 

LGKYINQELSLKrLLKDLLQLTENLK 


5855 


536 


2391 


LRSYGCKAPSRISHLHK\FXPL1JLPSLLMGYSBSPPPITDSWAP 
FISLTHHVLSQSQSPLSSNCWI CLSTHTQ* FTALPADLTiTWTQS 
NVSTinSYLAlPFIAD3FliKPV/L*PGKSAKHT^FKI^SIiSMVS 
GRAVALLHL I ASGLTS IQTNTASSKPPIWGY\L5TOrSFISPPP 
I^CLSRTYPl^AHATMVGO^TOSIiGGLIFTL/RTPCRPS ILHPNY 
KI ISTSAWQKVI^FSGSPTIHTSXjHLTTGSSFLSFHPIPGFPAA 
NS ALYVSSLKG PPGKNVTI PS PVTGT *Q PPHRGSN/ RLTVT)KDN 
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Amino acid segment containing signal peptide 
(A^Alanine, C-Cysteine, D^Aspartic Acid, E- 
Glutamic Acid, F= Phenyl alanine , G=Glycine, 
H-Histidine, I=Isoleucine, K?=Lysine , 
L=Leucine, M-Methionine, N-Asparagine , 
P=Proline, Q^Glutamine, R-Arginine, 
S-Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X»Unknown, *=stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








ffi^pkpnslholpsqVTPyqaltgaai^sypiwknentlswl 
ptpitnfci>stpslpflcdtn*yi j clpanwsgtctlvfqaptrn 

ILPPNQTILISVEAS ISSSPIRNKWALIILITLLTGLQ ITAAIiGT 
GIAGITTSITSYQTLFTTLSNTVEDMHTSITSLQRQLDFLVGVI 
LQNWRVIJ)LLTTEKGGTCI YLQEBCCFCVNKSGIVH IAVRRLHD 
RAAEL^HQVATJSWWQGSSLLRWIPWVAPFLGPLIFLFLLTWIGP 
CI FNbVSRFI SQRIiNCFIQASMQKHIDN I FHI/CHV+ YQSLRGNH 
SEAPKPRP 


5856 


173 


1137 


PWLHGLGLSAVPT,PYI>* /YVTFHLYGGI ILI*I»LIFISIAGILYK 
FQD\n^YFPHQPSSSRI*YVPMPTGIPHENIFIRTKDGIRIllI»II. 
IRYTGDNSPYSPT^lyFl^G^^NIGHRIlPNAbL^^VNLKVNnLL 
VDYltGYGKSBOBASEBGIj YIiDS£AVIiDYVMTS PDLDKTKI YLSG 
RSIG\GAAAIIILASDNSHRISAIMVEOTTFIjSIPHMASTLFSFFP 
MRYLPLWCYKNKFLSYRKISQCRMPSLFISGIiSDQLIPPVMKKQ 
LYELS PSRTKRLAI FPDGTHNDTWQCQG YFTALEQF IKEWKSH 
SPEKMAKTSSNVTII 


5857 


1597 


563 


KLIGKVLVI^WADAMAAFAVBPO^PAIjGSEP^IjGSPTSPKPG 
VNAQFLPGFLMGDLPAPVTPQPRS ISGPSVGVMEMRS PLLAGGS 
PPOPWPAHKDKSGAPPVRSIYDDlSSPGLGSTPIiTSRRQPNIS 
VMQS PLVGVTS TPGTGQSMFS PAS I GQPRKTTIiS PAQLDP FYTQ 
GDSLTSEDH\ LDDSWGDCI WGFLKAS A\S Y I WAQFAQ YGG IS * 
NMWMSNTGWWimiRYQSKLQ^ 

DKSVMKSSDRCALSSPSIiAFTPPIKTLGTPTQPGSTPRISTMRP 
LATAYKASTSD YQ VI S DRQTP KKDE SLVSKAMEYMFGW 


5858 




1419 


PPHQPAAASTSXriQQQQPPPPPQDSaKPWAQGPGPAPGVGSAP 
PASSSAPPATPPTSGAPPGSGPGPTPTPPPAVTSAPPGAPPPTP 
P33GVPTTPPQAGGPPPPPAAVPGPGPGPKQGPGPGGPKGGKMP 
GGPKPGGGPGLSTPGGHPKPPHRGGGEPRGGRQHHPPYHQQHHQ 
GPPPGQPGGRSEEKISGPRRGFKA^r^SLI^RPGEKTTYTQRCRFC 
LLGIYLLISRPJflNSRRIiFAKIWENQEKFLSTKAKDSEFIICLESR 
ALA*NCPKPBI/5*YTP*GGRQIiPSSLFPTHACLPI£CSVIFSPF 
MFPQ*NCWGRfCP FIIPKLGPHLKGAVCNRWDDPWBGPTGKGHCIjN 
FAS 


5859 


307 


1503 


GGSSARPRASSRRMbSRKXTKNEVSKPAEVQGKYVKKETSPLLR 

NLMP3FIRHGPTIPRRTDICLPDSSPIOIFSTSGDGVVSRNQSFL 

RTTIQRTPHEI^RKSNRLSAPSYLARSIADVPREYGSSQSFVT 

EVSFAVBNGDSGSRYYYSDNFFDGQRKRPI^P^HEDYRYYEYN 

HDLFQRMPONQGRHASGIGRVAATSLGNLTNHGSEDLPLPPGWS 

VDWTMRGRKY Y IDHNTNTTHW SH PLBREGLPPGWKR VBSSE FGT 

YYVDirmKKAQY\RHPC7\PTCTSV*ST^SCni/AS/ROjQTERNQ 

SLLVPANP YHTAE I PDVnjQVYARAPVKYDHI LKWBLFQLiADIjDT 

YQGMUOiLFttK^BQIVKMYRAYRQAra 

HGKNF 


5860 


2956 


1270 


TIRVEEFPLCPGGGKAQI^SASLLGAGLI^PPTPPPLI*^ 
LLtiFSRLCGALAGPI IVEPHVTA\WGKirVSLKCl>IEVNETITQI 
S WEKIHGKS SQTVAVHHPQYG FS VQGE YQGRVLFKNYS LNDA7 I 
TLHNIGFSDSGKYICKAVTF^LGKAQSSTrVTV^ 
PDSIJBGGNETVAAICI AATGKPVAI1IDWKGDLGEMESTTTS FP 
NETATI I SQ YKLFPTR FARGRRITCWKHPALEKDIR YS FI LD I 
QYAPEVSVTGYIXaNWFVGRKGVl^KCNADA 
QWPIX^LIiASDim^FVHPLTFNYSGVYICKVTVNSPGSKEVTQK 
WPrFQDPSLPTYPPIiPALQFQWASPSTA*TS. : U)\LATBP*KIA 

psplstl\ati KGWTQLPTI IA* csgvgalfiv\lvkcfglg I F 
CYRRRRTPRGD YFAKNY I PPSDMQKESQ IDVLQQDBIiDP YFDS V 
KK^^^KNPVNNLIRKDYLEEPBKTOWNNVEKTiNRFERP^fDYYEDL 
KMGMKFVSDEHYDBNEDDLVSHVDGSVISRREWYV 


5861 


2051 


1305 


EV(^CVQAFmiVASSGDI>SGGGDKCGCBVGSWGSMRVVfWUU»L 
SEGEQGI PTACAAFAQQPAG/BPRRGLAGVGEGGPQCSVA7NYRC 
TLEFLVSLLGTDLARGRGNSASGPTAPADSKQL/ML*DVHRRVI 
I^+RMNSGSPA^I^APSQRFGTI^EGI^PGISPSWRKAIjYGCH 
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SEQ 
ID 

NO: 


ST JL CUXV^UCU 

beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal, peptxde 
(A-Alanine, C-Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Olycine, 
H»Histidine, I»Isoleucine, K«Lysine, 
L=I»eucine, M=Methionine , N=A3paragine , 
P=Proline, Q=Glutamine, R^Arginine, 
S -Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Onknown, *=Stop 
Codon, /=?possible nucleotide deletion, 
\ ^possible nucleotide insertion) 








A 


5862 


1556 


483 


PPFQLIMGBIKVSPDYNWFRGTVPLKKIIVDDDDSKIWSLYDAG 
PRSIRCPLIFLPPVSGTADVFFRQILALTGWGYRVIALQYPVYW 
imEPCDGFRKI.LDHlX>IJ5KVHLFGASLGGP^ 
RVHSLILCNSFSDTS IFWQTWTANSFWI^PAFMIjaaVLGNPSS 
GPVDPMMADAI DFMVDRLESLGQSELASRLTTjNCQNSYVEPH|CI 
RD I PVTIMDVFDQS ALS TEAKEKM YKLYPNARRAHLKTGGNF P Y 
IXRSAEVNLYVQIHL/R/PJIS^PNTRPIjTHQWSVPRSIJICRKA 
ALASARRSS3VSLAVNDRT,TRCVLV*SVASAPVSRPFPSGSSGS 
PVLTVSGK 


5863 


2714 


249 


PFPSRGSIiPI^PREirn^PLM>rtiFCI>I,FLYPGlJU^PSCP^ 

VNISGGTFTI^HGWAPGSLLTYSCPQG^YPSPASRIiCKSSGQWQ 

TPGATRSLS KAVCKPVRCPAP VSFENG I YTPRI/5S YPVGGNVSF 

ECKDGFl\LRGSPVRQCRPNGMWDGBTAVCDNGAGHCPNPGISIi 

GP\VRTX5FRFGHGDKVRYRC^SNLVLTGSSERECQGNC3VWSGTE 

PI CRQPYS YDFPBD VAPAIiGTS FSHMIiGATNP1X?KTKBSI/5RKI 

QIQR3GHLNLYLLLDCSQSVSENDFLI FKESASLMVDRIFSF3I 

NVSVAIITFASEPKVTWSVT,NmSRDMTEVlSSLKNANYKDHKN 

GTGTimAALNSVYIjMMmQMRIJiGMBTMAW 

DGK\SKWGGSPKTAVDHIR£IIjM 

DWRBLN2LGSKKDGRRHAFI LODTKALHQVFEHMLDVSKLTDTI 
CEVGNMSANASDQERTP WHVT IKPKSQET\ C\RGAI*I SDQWVLT 
AAHCFRDGNDHSLWRVNVGD PKSQWG KE FLIEKAVI SPGFD VFA. 
KKNQG IL\EFYGD\DI ALL\ KIiAQKVKM\3THCQGPSCIiP\ CTM 
\RANLGFLRKTFKGSTCR\DHENEL/VWNKQSV\PAHF\VAIi\N 
GSKLKHJjTLRMGVEWTS CCRGLS PKKKTM\FPNI»T\DVRB\ VVT 
D \ OFJj\C5 \GPQKUBoF A Ljv w is \£>G^^ 

SWGL\YNP\CT-GSA\DKNS PKKGPSVAKVPPPTR/DFHIN\I*FP 
□♦SPl^QlII^MS^lFXPLIJUJGm^SPFACPARICRPIiKFLPS 
BWATLRTL 


$864 


173 


1013 


PlflSVPQStjISIiPQPliCFPGGQEPSAPSPCLYSFl^WA^FTMG 
KLPPS I PPSSPIiAjCVLKNI*KPLQLTPDLKPKCI»IFFCNTAWPQY 
KIiDHDSK* PBNGTFBFSII^VLJMiSCHKMGKKSEVPDVQAFF\S 
HWSLPSLCSQC/GLIPNLSSFSPFCSFG/PPPQVPSP/TRSFFS 
MDSSDLPPSPQAAPRQAEPGPHSHLASAPPPYNPFITSPPHTWS 
duyrnov i.%»rifM:jrfvj%£E i wui.v Jtumvv« v ivvonrr ouoyui itu 
GSFSSNI KIQPSSWLI WQQP 


5865 


568 


1684 


CLPGPRWGBGWRAGHTIVGCI FFKTAI ZSHFKGGMYLCVCMCTC 
LSVCVCVQVGSWI CV/CVSMCACVSLCTC\ICRCISMYTRBHAC 
ACTRV*VYMCMS/VCTCT/STCIDVRVCAHVCVYMC1^ 
TCV*MCVCMHEHVCMC/VC^CSCVLL/CRGHICM/MCMSAYICI 
/LVYVCVLCVWAmJWSTCVWLVYG*ACTCVWMHM CSCTCR/C 
VHVCCMSMHACBCLCVYIiHI CGCAGTRRWWAGSARGSRSCSRLP 
CHAPGPGLSLPGPSCPSVEQGLGGGPGQIvQGRSGBARZ^EHRGW 
GSPAAVCSRNCTVS PRRGAIX!FEAPDVP KQPPGWGRASFEBRGC 
GGRGWVCAPPIJ^PQCCXZFSI KPELKAKKKK 


5866 


98 


3197 


ARPBWAPPAWLSRRGAAKMGDKJCDDKDSPKKNKGKERRDLDDL 
KKBVAHTEHKMS VEKVCRKYNTDCVQGLTHSK^ 
LTOPPTTPBWKFCRQLFGGFSII^IGAILCFrAYGIO^TED 
DPSGDMI^YliGIVLAAVVX ITGCFS YYQEAKSSKIMESFKNMVPQ 
QALVIREGEKMQVNABEVWGDLVEIKGGDRVPADLRI ISAHGC 
KVDNSSLTGBSEPQTRS PDCTHB\NPUCrRNITFF3KNFVEGTA 
RG^AA/ATGDRTVMGRIATLASGIjEVGKTP lAIBlBHFIQLITGV 
AVFLGVSFFILSLILGYTWLEAVIFLIGI IVANVPEGUATVTV 
CLTLTAKRMARKNCLVKNLFJ\VBTIGS*rSTI(^DKTGTIiTONRM 
T^toHMWFDNQniRAlHTEDQSGTSF^ 

NRPVFKGGQDNIPVLKRDVAGDASESALLKCIBLSSGSVKLMRE 
RHKKVAK I P FNSTNKYQI^ IRF/TKDPOTNRY1XVM1(GAPER ILD 
RCSTI LLQGKBQPliDEE MKEAFQNAYLHLGGLGERVIX3PCHYYIi 
PBP^FPKGFAFDCDDVNFTTDNLCFVGI»MSMIGPPRAAVPnAVG 
KCRSAGIKVXMVTGDHPIXAKAIAKGVGI IFBGNETVEDIAARIi 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
aeguence 


Amino acid segment containing signal peptide 
(A^ Alanine, O Cysteine, D*=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I~Isoleucine, K=Lyeine, 
Ii=Leucine, M=Methionine, N~ ABparag inc , 
P- Proline, Q^Glutaraine, R^Arginine , 
SwSerine, T«Threonine, V=»Valine, 
W=Tryptophan, Y=Tyrosine, X»Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








NIPVSQVNPRnAKACVIHGTDLKDFTSEQIDEILQNHTEIVFAR 
TSPQQKLI IVEX5CQRQGAIVAVTGDGVNDSPALKKADIGVAMGI 
AGSDVSKQAADMItLDDNFAS TVTGVEEGRliIFDNLKKSIAYTL 
TSNIPB IT^PIiFIMRNIPLPLGTITILCIDI/STWWPAXSIiAY 
EAABSD I MKRQPRNPRTDKLVNBRIi IS MAYGQIGM IQALGGFFS 
YFTIIA^GFLPGNLVGIRLNWDDR'nfNDLEDSYGQ^WTYEQRK 
VVEFTCHTAFFVSIVVVQWADLI I CKTRSNSVFQQGMKNKIL I F 
GLF3ETALAAFI*SYCPGMDVALRM YPLKPSWWFCA^ 
YDEIRKLILRRNPGGWVBKETYY 


5867 


3 


1485 


LPGRRARGGRGIiGWPPAQAUX?SRMGKAKVPASKRAPSS?VAKP 

vj Jtr V IV1 Li _i xCIVJvJI nXMVJUC Jr Fl n£> n/VKA v AAJ^Jr/KH7krs3AV VrCr^ IV/Urb 

DPSQNWKAU2EWIiI,KQKSQAPEKPLVISQMGSiaCKPKI IQQNKK 
BTS PQ VKGEKMPAGKDQELRSRGS VPSGS KMDRRAP VPRTKASGT 
BHNKBCGTKERTNGOIVPERGDIEHKKRKAK\GQPQPHPPR/lDI 
HFDDVDPADIEAAIGPKAAKIARKQLGQSEGSVSLSIiVKKQAFG 
GLTR ALALBCKMVG VGPKGEBSMAAR VS 1 VWQ YGKCVYDKYVKP 
TEPVTDYRTAVSGIRPKNLKQGKELEVVQKEVABMLKGRILVGH 

I LGI£V0^}AEHCS IQDAQAAMRLYVM VKKEW 

PDHCSDDA*QSCPAAAAAPLQROCDQSQGQITSPQSGNSGETFS 

ESHQRGVAWCY 


S868 


2122 


833 


LTAG^HTQDASQSa^AKYPAAAONL/CVTNAMREDUlDIWYlR 
AVTVYDKPASFPKBTPIJiljO^iRLFMKIiGSMHSPPRARSEPED 
l^SAFTERDAGSGL VTRLRERPALLVS STS WTEDEDFS I LIAA 

TtR^IW+TXM^.TY^MNT.PKT.VrVTTf^KnPT.RPVVQPT.TWnTrHKYTTI 
jluj^jk. v x \njtJjtA3rx3nxt«roJuv^vx x vwviriJKCX x uAuxnUAJiryti 

IQVCTPWLEAEDYPLLIjGSADIXJVCLHTSS 

CCLPVCAVNFKCLHELVKHEE^I^VFBI^EEI^QLQMLFSNFP 

DPAGKliHQFRKNIiRESOXJI»RHDESWVOT^PLVI^ 1 


5869 


2122 


^833 


IiTAGASHTQDASQSTSAKYPAAAQNL/ CVTKAMREDLADIWY IR 
AVT™KPASFFKETPIJ>U}HRLFiraX3SMRSPFRARSRPEDPV 
TERSAFTERDAGSGLVTRIjRERPALLVBSTSWTEDEDFSILLAA 

IQVCTPWiKAKDYPLI^SAl)IWa^SSSGLDLPMKVVDM 

CCLPVX^VNFKCIJIBIjVKHEENGLVFBDSEELA^ 

DPAGKI^QFRKNIJ^QQIAWDBSWVCym^VMDT 


5870 


2122 


833 


LTAGASHT0J3ASQSTSAKYPAAAQNIi/ cvtnamrkdladiwyir 

AVTVYDKPASFFKETPIjDIXJHRIaFMKLiGSMHSPFRARSEPEI>PV 

TERSAFTERIW5SGLVTRLRERPALLVSSTSWTEDEDFSI LLAA 

I>ESRV+T\MTl^HNLPSIArCVITGKGPI^^ 

IQVCTPWLBABDYPLI^SAIJLaVCLHTSSSaLDLPMKVVDMFG 

CCLPVC^VNF1CCLHEL,VKHEENGLVFEDSEBI^ 

DPAGKI^QFRKNLRESCWLRWDBSWVOTVLPLVMDT 


5871 


3 


3465 


FPFC^PLRLYSKTTGDRSAMAGAAGI>TAE\^WKVLERRARTIG5S~ 

VLKLL*I^LRRI,*I 1 KPTI*l^LLT*CSRIiSVFRFLKV\GSVYEP 

LKSINLPRPDNETLWDKIJDHYYRIVKSTLIXYQSPTTGLFPTKT 

CGGDQKAKIQDSLYCAAGAWAIJUAYRRlDDDKGRTHEIiEHSAI 

KC!MRGILYCYMROJ^DK\^X}FKQDPRPTTCUlSVFNVHTGDELI^ 

YEBYGHLQI MAV3 LYLJjYLVEM I SSGLQ I IYNTDEVSFIQNLVF 

CV\ERVYRVP\DFG\VWGKREGKYY*/SGSTELHSSSVGLGKRQ 

L* KQFNGFNLFGtlQGCSWSVI FVDLDAHWINRQTLCSLLPRESR 

SK^rnAALLPCISYPAFAIJ)DEVLFSQTIJ)KVVRKLKGKYGFKR 

FLRDGYRTSLEDPNRCYYKPAE IKLFDG I ECEFPI FFLYMMIDG 

VFRGNPKQVQBYQDLLTPVLHHTTEGYPWPKYYYVPADFVEYE 

KNNPGSQKRFPSNCGRDQKLFIjWGQAL YI IAKTiLADELIS PKDI 

DPVQRYVPIJaDQRNVSMRFflNQGPIiENDLVVHVALIAESORLQV 

FLNTYGIOTQTPCXJVEPIQIWPOQELVKAYLQEiGINEKlXiLSGR 

PDRPIGCLGTSKIYRILGICTWCYPI I FDI£DFYMSQDVFLLID 

D I KNALQFI KQYWKMRGR PLFLVI.IREDNIRGSRFNP ILDMLAA 

LKJ(GIIGGVKVHVDRIjQTIiISGAVVEQLDFLRISDTEBIiPEFKS 

FEELEPPKHS KVKRQS STPSAPElfGQQPDVNISBWKDKPTHKIL 

qkw?ixsciasoaxi^iixkregpnfitkegtv^ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide " 
location 
corresponding 
to first 
amino acid 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
ani'l no acid 

sequence 


Amino acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D=Aspartic Acid, K» 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H^Histidine, I=Isoleucine, K=Lysine, 
I>= Leucine, M=Methionine, N=JVsparagine , 
P«Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine , V=»Valine, 
WdT^vnhoTjhiAn . Yr=TvT"o«*ine 3t=tlTnltnowr» . *=Sfccm 

Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








AGSQKLW5 WRRAASLLS KWDS LAPS I TNVLVQGKQVTLG AFG 
EEEEVlSWPJUSPRVIQNIIYYKCNTlffiERKAVIQQELVIHIGWI 

SEVKQIJLIJDI1X}PC&NGRCWLNR^^ 

LERTPNGI I VAGKHLPO^PTLSDMTMYEMNFSIjLVEDT^ 

PQYRQIVVKLW1WSIVLEI^PBLEFQDKVDIJ)RLVKEAFNEFQ 

KDQSRLKEI EKQDDMTSFYNTPPIX5KRGTCSyLTKAVMNLLLEG 

EVKPNNDDPCL I S 


5872 


68 


665 


v0gyt4yrfvikinscysektsicrhr(xpelpatqpwptptvpp 
nia1ds eslgci \sfklfadkv/pkrwkknfvllntgekvlgdk 
gpcpyriipgVlcqggdpthhngtggkslyskefddenfi /lkh 

TAPGVI^TANAGPTTNGSQFFICTAKTEDG*QHV^ 
IVEALERSGSRNGKTSKKITAANCGQI* 


5873 


2240 


506 


RRPPEGGSGGGRRTRARMPLPHSLALPI^IjSWVAGGFGNAASAR 
HHGIJ»ASARQPGVCHYGTKIACC^^ 

gbc^gpnkcrcfpgytgktcsqdvnec^kppj>cq^ 

ykcfci>sc3imiiMpdatcwsrtcamihcx}ys 

ssglrliapngrdci^idecasgk^cpynrrcvntfgsyyc^ 

rGPBLQYXSGRYDCIDINBCIWSHTCSHHANCFNTQGSFKCKC 

kqgykgnglrcsai pens vkevlrapgti kdri kkllahknsmk 
kkakiluwtpbptrtftpkvniiqpfnyeeivsrggnshggxkkg 
neekmkbglbdekreekalkd *hrrerpfkg\ dvffpkvneage 
fgltlXvorkaltsklkhkadlni svdcsfnhg\ icdw\kqdr\ 
eddfdw\npadr\dnai \gfy\mavpglwqghk\kdigrlklll 
pdi^pqsnfcllfdypj^gdkvgki^vfvknsnnalawekttse 

DEKKKTGKIQLYQGTDATKSI IFFJu^RGKGKTGEIAVDGVLLVS 
GLCPDSLLSVDD 


5874~ 


2 


3387 


ACPRliARRRRRVRSLRPJRRGHLRARWSRGQNNMAARR^TOET 
AVTX2EKAKRYHMDASGEAVSETLOFKAQDLLRAVPRSRAEMYDD 
VHSIX5RYSLSGSVAHSRDAGRESLRSUVPSGPSFRSSNPSISDD 
SYFPvKECGRDLEFSHSNSRDQVIGHPJCuGHFRSQDWKFALRGSW 
EQDFGH PVSQES SWSQBYS FGP SAVLGDFGSSRL I E KECLEKE \ 
SRDYDVDHSG\EA\DSVLRG8\SQV0A\RGRALNIVDQEGSLLG 
. KGETCX^I^TAKGGVGKLWIJWVSTKKI PTVNR I TPKTQGTNQ I 
QKNTPS PDVTLGTNPGlin)IQFPIQKI PLGLDIiKNLRLPRRKMS 
FDIIDKSDVFSRFGI BIIKWAGFHT1KDDIKFSQLFQTLFELBT 
ETCAlQMLASFKCSiaCPEHRDFCFFTIBTIiKHS 
NMLLDKGAVKTKNCFFEI I KPFDKY IMRLQDRLLKS VTPLLMAC 
NAYBI^SVKMKTLSNPLDIJu^ALST^ 

5FRQEKIL*AVGI^DIAPSPAAFPNFEDSTLFGI^XDHLKAWL 
VS SGCPLQVKKAEPEPMREEEKMI PPTKPE IQAKAPSSLSDAVP 
QRADHR WGTICQLVKRVI EGSL3 PKERTLLKEDPA YWFLS DEN 
SLBYKYYKLKIiABMQRMSBm»RG^KPTSADCa 

PT^.KTna.T,P\^PRQT.T.PA<^\T.Bn\WirARWft\'l"l'<-r^f'I.T.PT,P 

apglkhhgrqapgls \qakps lpdrnd\aakd\cpldpv\gpsp 
qd psleas gp3pkpagvdiseapqtss pcps adidmkdngrtae 
klarfvaqvg\peieqf\si\enstdnpdlwfl\hdqnss\afk: 
fy\rkkvfelcpsicft^phnl\htgggdtt\gsqbspvdlmb 
graefedeppprbaelespevmpebededdkdggbbapa\pgrg 
gps legst padglpgea\ aeddl/ algapalftqllqvtcfp fg 
RGFSSKSXiKVGMIPAPKRVCLI qepkvhepvri aydrprgrpks 

KKKKPKDL0FAO^KI>\TDK\RLGFQ\MLQKMGWKEGHGIX5SIX9K 

gxrXsrsactqqaawggsg^lspstcslpt^gsftakmayswql 

IFVF 


5875 


296 


1848 


LAALGGLPtWRI^RRGFRBYIiGLSAPSALGGAMR^ 

I^PSGSLFPHAICI^DVDNDTLNBLVVGDTSG 

WLTCSOJGMLTCVGVGDVCNKGKI^VAVSAKGWFHLFDLTPAK 

VLDASGHHBTLIGEEQRPVFKQHI PAOTKVMLI ^ 

VVGYTDRWRAFRWEEI^EGPEHLTGQLVSLKKWMLEGQVDSLS 

VTLGPLG^ELMVSXlPGCAYAILLCTmCI^^ 

/SGDPSCPRRGAAPDIWPYPQQECLHSPNWQHQT\SHGa'KSSGS 
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SBQ 
ID 
NOr 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A- Alanine, o=Cysteine, EteAspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=*Glycine, 
HcHistidine, I=*lsoleucine, K^fcysine, 
jj=ljeuc j.ne , .i^nfi z on x nt* t v* ~j\op*x tciy xuxz , 
P=Proline, Q=Glut amine , R-Argiiiine, 
S^Serine, ^Threonine, v«Valine, 
W-Tryptophan, Y-Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=>po3sible nucleotide insertion) 








GLFAI»CTI*DGTbKIJ*BEMEEA^ 

NGHE EWAvJA W UGvFi Y 1 J- UHN K. 1 yVKifyjVUtJilHH^\^JV^UJ.HL,t\. 

HGRHSPCLVYVTFNQKIY\n(WEVQLBRM^ 
TACCRSWAWILTTSL*IiVPCFTKRSTIQTSHHSVLPQASIiI PPS 
KTCLIAGBGPF*TPTLPPHGVFGSHCAAAGSITKQ 


SB76 


1122 


224 


HLPLGVPSKVAGAAAMKPQEKRETQVAAHIjKKIFGDHPIPQYBV 

KPRTTBIIiHHLSERNRVRDRDVYLVIKDLKQKASBYESEAKYIjQ 

DLLMES VNFSPANLS STGSRYIiNAL VDSAVALETKDTS IAS P I P 

AVNDLTSDLFRTKSKSEKIKlEI^ia^KHLTATLVLEKCLQEDV 

KKAELHLSTBR\AKVDNRRC^W\DFTjKAKSEEFRFGIQAA 

SARGQ\DAFSVPlQSLVAI,IRBNWPRLKQOTIPUK\KKIiESYt.D 

LMP\NPSHCSK*RIEBAK\REIiA\SIEAELTRRVS\MMEIi 


5877 


2030 


1907 


GTl^KMAASS3GgKEKERLG<X5I*GVAGGN^ 

LSRBLIEMIiAISRNQKJjLOJ^EBNQVIiBMjIHRI^ 

LN(K?KIHHKMQVLKKBVKK3?DSDI(XiIiQKQIJCBAEQILATAVYQ 

AKEKL KS1 EKARKGA I SSEE I I KYAHR ISASNAVCAPLTWVPGD 

PRRPYPTDLFJtiRSGIjIXXSMNNPSTNGV^ 

CTCST\re/NGS0>1TCR*INIILII^KSVCBL 


5878 


950 


2113 


GLWKCMQLQGPHTHRVQP* PTPRQQGPQ \ VPVAVIAGNRPNYLY 

RMU^LLSAQGVSPQMITVFIDGYyBBPWDVV^ 

IS IKNARVSQHYKASIiTATFlOjFPEAKFAVVLBEDIiDIAVDFKS 

F1^0^IB1aLBEDD3LYCISAWNDCX3Y3HTABDP 

DGWVI^RSLYKEKl^PKWPlTEiCLWDWDMWP™ 

PBVSRSYHFGIVGLNMNGYFHEAYF1G<HKFbnVPG^ 

KXRAYFAJEVHRLIiSEABVIjDHSKN^ 

MB KDDD FTTWTQIJ^CIiHI WDLDVRGNHRGLWRLPRKKbHIFlj^ 
GVPASPYSVKKPPSVTPIFLEPPPKEBGAPGAPEQT 


5879 


3 


981 


RLtBAAAAGSGSRAAGWAGSPPTUiPLSPTSPRCAATMASSDED 
GTNGGASEAGEDRBAPGKRRRIiaFIATAV^TFYDIAMTAGWIjVL 
AIAMVRFYMBKGTHRGIiYKS IQKTLKFFXJfTPAIJjB IVHCX IG I V 
PTSVI VTGVQVSSRIFMVWI.ITHS IKPIQNBBSVVLFLVAWTVT 
HITRYS FYTFSIJiDHLP YFI KWARYNFFI ILYPVGVAGBLLT I Y 
AALPHVXKK5MFS IRIjPNKYNVSFDYYYFLLITMASY1PLFPQI> 
YFHMLRQRRKVI*HG\G*L* KRMIK* SLQlRCFFQNWOjDYLS PS F 
NNKNKQLCEISWIVWFLKI 


5880 


1138 


1324 


SLWCLVAGGLGLGPSSONPLQRAGIIJ^PREARGTPSAIiTACSA 

SVTSKGKSSSG^PSAASDRJ>SPVPIxRPPGPVC^ 

* KKKRGRCSS / WLSQPQHBREKEWLLRRSMAEGERARAASDVL 

CRSXiANETHQIjJUnTiTAra f 

D0SBHTDGHTSVQSV1 EKI^BENRI&KQKVTHVEDLNAKKQRYN 

ASRDEYVHGLHAQLRGLQIPHEPELMRKEISRI^RQLEEKINDC 

AEVKQELAASRTARDAAIiBRVQMLBQQI IAYKDDFMSERADRER 

AQSRlQEtEBKVASLLHQVSWQI^RBPI^RIHAGSKTAKYIJV 

ADAfcBLMVPGGKRPGTGSQQPEPPAEGGHPGAAQR^ 

CLGCFSDEGGEELLRHVABCCQ 


5881 


26 


441 


GG1 EPSPTEAPRAQHLTMDCTWRILFIiVAAATGTHAQVQIXQSG 
SF^KKPGA5VMVSCYVSGYTLTKLSMHWVRQAPG 
I^IWETIYPQKFOGRVSMTEBTSTETTQ/ AYLELSSIjRSEDTAV 
HHCATDTV 


5882 


2407 


2216 


SGCVEWT »YSHSI iBYNPBW ISVQSAVAPAQUVIHSDGDIi* IiHSGB 
RTRRD*QJL.PEAGGPGLQEPLQLGEIJ>ITSDEFILDEVDG\VDLR 
HYSKQVBLELQQIBQKS IRDYIQB5ENIASLHNQI TACDAVLER 
MEQMLGAFQSDLSS ISSEIRTLQEQSGAMNIRLRNRQAVRGKLG 
ELVDGLW PS ALVTAI LEAPVTE PRFI^QLQELDAKAAAVREQE 
ARGTAACADVRGVIJ)RLRVKAVTKIRBFILQKIYS FRKPI^fTNYQ 
I PQTALLKYRFFYQFLLGNBRATAKEIRDEYVETLSKIYLSYYR 
S YLGRIWKVQYEBVAEKDDLI^GVEDTAiQQSFFSKPSlASR^ 1 F 
TLGTRGS VI SPTBLEAP ILVPHTAQRGEQRYPFEAL FRSQHYAL 
IJ)NSCREYLFICEFFWSGPAAEaiLFHAVMGRTl^ 
LADCYBAIAVFIXIIHlVLRFRltfXAAKRW 
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SBQ 
ID 

NO: 
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beginning 
nucleotide 
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correspond! ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F= Phenyl alanine, GaGlycine, 
H=Histidine, I=Isoleucine, K«Lysine, 
L=Leucine, M=Methionine, N-Aeparagine , 
P=Proline, 0=Glutaaine, R-Arginine , 
S=Serine, T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, X=Gnknown, *^Stop 
Codon, /possible nucleotide deletion, 
\=po3sible nucleotide insertion) 








PRPKLIi.EMNVQSVRSTDPQRIiGGLirrRPHYITRRYAEFSSALV 
SINQTI PNERTMQLLGQLQVEVKNFVLRVAAB FSSRKEGLVFLI 
NN YDMMLGVLM\E * ERAADDSKKVBSFQQLIiNARTQEFIEKLLS 
PPFGGLVAFVKEAEAIJBRGO^ERLRGEEARVTQLIRGFGSSWK 
SSVES LSQDVMRSPTNFRNGTSI I QGALTQLIQ\LYHRFHRV\L 
SQPQLRALPARAELINIHHLMVELKKHKPNF 


5883 


2 


1374 


BFPGRRFl^VMBAGAGAGAGAAGWSCPGPGPTVra 

CERKKGQRWGSLERRGMQAMEGEVLIiPALYEEBBEEBBEEEBVB 

EEEEQVQKGGS VGS LS^7NKHRGLSIiTErrBLEELRAQVI^3LVABI* 

EETRBIjAGQHEI)DSLBIjQGI»IiSDERIJVSA<X)AEVFTKQIQQLQG 

ELRSLREE ISLI»EHBKESBLKBI EQEIiHLAQAE IQSLRQAAEDS 

ArBHBSDIASl^EDLCRMO^JEI^BDMKRIRGDYEME IASLRAEME 

MKSSEPSGSliGLSDYSGIiQEEIiQEIiRBRYR^LNEEYRALQESlIS 

SIiTGQLADnESERaX>RATERWLQSQTI*SMTSAESQTSEMDFLEP 

DPH>lQTJ 1 RQQIiRDAKEQMHGMKNKrOEI J CCELERT 1 QHHRQVr > BE 

EQRI2IX3RELKCAQNKVIJ^FQTSHS\SPSHPLPPIPPSSPCLL*A 

LWISALLWCWWAETSS 


58B4 


4261 


2522 


GVIARASAIUiRVPLTGVRAC^PEVGAEPAKVAGAAKPDE^ 
SRLRDCGDYTPSERIiGPKGAMLWFQGAI PAAI ATAKRSGAVF VV 
FYAGDDEQSTQWiASWEDDKYTEASSNSFVAIKIDTKSKACXQF 
SO I YPWCVPSSFFIGDSGI PriKVTAGSVR AT>RT.*UTP THJCVPAM 
EIJj.KSKTS VANG SOS KS S VST PSAS FK PNNTCENS QS RNAKT , CE 
IPSTSDTKSDTAIGGESAGHATSSQEPSGC^ 
RLTKKT.EBRREEKKKEEEQRB 1 KKE I ERRKTGKEMLDYKRKQEB 
B LTKRMLE BRNRBKABDRAARER I KQQ IALDRAERAARFAKTKE 
BVEAAKAAAIJiAKQABWEVKRESYAREKST 
TNQFPSDAPLEBARQFAAQTVGNTYGNFS latmfprreftkedy 
KKKLI^LBIJ^SASVVIiI^/ABFIMF*AGRPTASIVH^ 
TLLGTVLYPFTjAI WRIjI SNFLFSNPPPTQTSVRVTSSEPPMPAS 
SSKEEKREPWKRVLEK^GDDFKKF^KIYRiRTQDDGEDENNTW 
NGNSTQQM 


5085 


900 


467 


A^GGGRRSRLSRSWPTGPSKSPSGVRCCG\RR\AWEDKDBFIiDV 
IYWFRQI IAVVLGV I WGVIj PIJIGFLG IAGFCLINAGVLYL YFSN 
YLQIDEEEYGGTWBLTKEGPMTSFA/ I VHGHLDHIiLHCH PL * LM 
VYSSQVLPIQ3KGPS 


S88G 


85 


1341 


PFRGRAbTLICKQPRPGVAPPSLGTCHKSDPGRPAAQSOPPSPGS 

GTFGLLSFRfWRTKTWTI*IO^FVGYPTNSDFEI>KTSEIiPPLKNG 

BVMiEA^FLTVDPYMRVAAKRLKBGDT^^ 

PKGTIVLASPGWTTHSISIXJKDLEia^TEWPDTIPLSLAI^ 

MPGLXAYFGLLE ICX^GGETVMVNAAAGAVGS WGQIAKLKGC 

KVVGAVGSDEKVAYLQKI^PDWFNYKTl^ 

IX^fFDI^GEFSiriVIGO^KFGRIAICX^ISTYNRTGPLPPGP 

PPBIGIYQBljRMFJVFVVYRWQGDARQKAIiKIJLIiKVW 

D*IX3ANTLVYKSMKSAKPSI^ISEKLVSG\KIQYKEYIIEGFE 

NMPAAFMGMLKGDNLGKTIVXA 


5887 


1937 


104 


APGCRGCRATRCPCRGPRWD5I^BAARSPAAPGaAPGLLGIiRB 
RPDRCHPGGDDRGPQLHRGSPG/SPSBLSRRPGPPGLPGLQGPP 
PAPGLPQSRTL/PVLCVCDLS PAQCDINCCCDPDCSSVDFSVFS 
ACSVPVVTGDSQFCSQKAVIYSLNFTANPPQRVFELVDQINPSI 
FCIHITO V *NkHYPLLIQIOrb/NBNNF 

VSFTTKI^r PTAAKYEYGVPlOTSDSFIJiFPSSLTSSLCTONNP 
AAFLVNQAVKCTRKI NLEQCBEIEALSMAFYSS PK I LRVPDSRK 
K^IWQSIVIQSLNKTLTRREDTDVLQPTLVNAGHFSLCVNVV 
LEVKYS LTYTDAGBVTKADLSFVLGTVS S VVVPLQQKFE IHFLQ 
E^PVPLSGNPGYVVGLPLAAGFQPHKGS^IIQTTNRYGQLTI 
LE^TTEQDCLALEGVRTP VL FGYTMQSGCKT41LTGALP CQLVAQ 
KVKSLLWGOGFPDYVAPFGNSQGP/AD>fLDWVPIHPITQSFKRK 
DSCQliPGALVTEVKJfnCYGSLLN 

S GNERTI LIS TAVTFVDVS APAEAG FRAPPAXNARLP FNFFFPF 
V 
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ID 
NO: 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid' 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C- Cysteine, D=Aspartic Acid, B=/< ;. , 
Glutamic Acid, F= Phenylalanine, G=*Glycine, ' 
H=Hietidine, I=Isoleucine, K»I*ysine, 
L=Leucine f M^Methionine, N^Asparagine , 
P= Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine r 
W-Tryptophan, Y=Tyxosine, X=Unknown, *=Stop 
Codon, /=»possible nucleotide deletion, 
\=possible nucleotide insertion) 


5888 


375 


2302 


WiCRTPGVAMQRADSEQPSKRPRCDDSPRTPSNTPSAEADWSPG 
LEl^PDYOWGPEQVCSFLRRGGFEEPVLLKNIRENBITGAIiLP 
CLDESRPKNIX^SI^BRKKLLSYIQRLVQIHVDTMKVINDPIH 
CHIELHPI^WIIDTPQFQRLRYXKQIiSGGYiVFPGASHNRFKH 
SLGVGYIiAGCLVHAIiGEKQPEI^ISERDVLC^ 
PFSHMFDGRFI PLARPEVKWTIIEQGS VMMFEHL INSNG I KPVME 
Q YGLI PEED I CFIKBQ I VGPLESPVEDSLWPYKGRPENKS FLYE 
I VSNKRKFG I DVDKVTO YFARIX31HIiGI QNNFDYKRFI KFARVCEV 
D^IJR I CARDKE VGNI>YDMFHTRNSL^^ 

mFLKADDYIEITGAGGICKYRISTAIDDMEAyTKLTnNIFLEJL 
YSTDPKLKDAREILKQIEYRNLFKYVGETQPTGQIKIKREDYES 
LPKEVASAKPKVLLDVKLKAEDFIVDVTNMDYGMQBKNPIDHVS 
PYCKTAPWRAIRITKKQVSQIJiP\BKFAEQ\LIRWCKKVDRKS 
LYA\ARQYFVQW\CADR\NFT\KPQDGRCY* PPTP*HPQKKGW\ 
NDSTFS PKI PTRLPRRLPKSRV\QLFKDDPM 


5BB9 


1831 


731 


LPAACGRPVTARPRQAPJBGRSGRPRDL3PYPPQVFPPRPBRVAI 
VTGGTI^IGYSTAKHLARLGMHVI IAGNNDS KAKQ WSKI KEET 
LNDKET * VLLCCPGWLCIiWNSSDPPTSASRGAGTTGVHHHFIiLK 
FG I F I L \ DUVSMTS IRQFVQKFKHKKI PliiVL INNAGVMMVFQR 
KTRDGFBEHFGLNYEX>HFIiLTNI^ 

SATHYVAELiNMDDIjQSSACYS PHAAYAQSKLMjVLFTYHLQRLL 

AABGSHVTANVVDPGVVNTDLYKHVFWATRIAm 

DEGAVtTS rYAAVTPELEGVGGRYLYNKKBTKStHVTYNQKLQQQ 

LWSKSCEMTGVLDVTL 


5890 


1322 


200 


FRRGWSAAGRAVPYAFCSRISASSPRRPRGAVRLQSGTEAACRS 

GRPDPRPASAACKJHAGERMSQRimjVHLFAGGCGGTVGATLTCP 

LEVVlCTttiQSSSVTLYlSEVQUroiAG^VN^ 

I LB KEG PRS LFRGK5PNLVGVAPSRAI YFAAYSNCKEKIiND VFD 

PDSTQVHMISAAMAGFTAITATNPI WLIKTRLQL* /SQGTAGKR 

RMGAFECVRKVYQTDGLW3FYRGMSAS YAGI SETVIHFV I YES I 

KQKLLEYKTASTMENDEESVKEASDFVGMMLAAATSK\LVATTI 

AYPHEVVRTRI^EGTKYRSFFQTLSI^VQBEGYGSLYRGCbTTH 

L VRQ J P \NTAIMMAT YELWYHiNG 


5891 


1322 


200 


FRRGW3 AA0RAVP VAFC3RI SASSPRRPRGAVRIX5SGTBAACRS 
GRPD PRPASAAGGHAGKRMSQRDTLVHLFAGGCX3GTVGA ITiTCP 
LEVVKTRIiQSSSVTLYlSEVQLNTMAGAS VNRVVS PGPLHCLKV 
II^KEGPRSLFRGIXJPl^VGVAPSRAIYFAAYSNCXEKIiroVFD 
PDSTQVHMI SAAHAGFTAX TATNP IWI»1 KTRLQIj * /SQGTAGKR 
RMG^KCVRKVYQTIXa^PYRGMSASYAG I 
KQKLLBYTCTASTMENDEES VKEA3DFVGMM LAAATSK\LVATT I 
AYPREVVRTRLREEGTKYRSFFOTLSLLVQEEGYGSIjYRGLTTE 

lvrqi p \ntaimmatyblvvyllng 


5892 


1764 


379 


VVLRVGGRI*SVNSAVSSRTGGWSAGI/TCAM^^ 
DSGWPOJVAPCLSGAPHASAADVVVVHGRRX^ 
TPDKLLSAVWIAVLKDVNIJiPBQIXJDICVGNVLQPGAGAIMARI 
AQFLSD I PETVPLSTVNRQCSSGLQAVAS IAGGIRNGS YDIGMA 
CC VESMS IiADRQN PGN I TSRLW E KE KARDCL t PMG ITS ENVAER 
FGX SRE KQDTFALASQQKAARAQS KGCFOjAKrtnFVTn^VHDDKG 
TKRS riVTQDEGIRPSTTMEGIiAKLKPAFKKDGSTTAGNSSQVS 
DGAAAXLLARRS KAEELGLP 1I/J VIiRSYAWGVPPD I MG IGPAY 
AI PVALQKAGI.TVSDVDI FBINE \AFASGJ\AYCVEKI*RLPP*BG 
♦TPLE5GASGP *GHPLGLHWGHVQVITIjAQ* S *SARGKRAYRSGC 
PCAIGSWNGSPLPVFEYPWGT 


5893 


3 


1653 


ILSKRRCQKJUCTKEIJIAKKVAVIGAGVSGLISIjKCCVDEGLEPT 
CFERTED IGGVWRFKENVBDGRAS I YQSWTHTS KEMS CFSDFP 
MPEBFPNFLHNSKIJ^YFRIFAKKPDLLKYIQFUrTVI^VRKCP 
DFSSSGQWK^Am)SNGKEQSAVFnAVMVCSGHHII>PHIPLKSFP 
GMERFKGQYPHSRQY1CHPDGFEGKRI LVIGMGNLGSDIAVELSK 
NAAQVFISTRHGTWVMSRISEIXJYPTOSVFHTRFRSMLRNVLPR 
TAVK>WIEQQMNRWFNHENYGLEPQNKYIMKEPVLNDDVPSRLL 
CGAIKVKSTVKELTETSAI FEDGTVEENIDVI I FATGYS FS FPF 



395 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
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nucleotide 

location 

c?f* virfisoond i na 

to first 

air.ino acid 

residue of 

amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A= Alanine, C= Cysteine, D=d\spartic Acid, E= 
Glutamic Acid, F=Pnenylalanine, G=Glycine, 
H=*Histidine, I=Isoleucine, K=I»ysine, 
I*»Leucine, M-Methionine, N^Asparagine, 
P= Proline, Q=Glut amine, R=sArginina, 
S=Serine , T=Threonine , VwVal ine , 
W=»Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








I^DSLVKVKNNIWSLYKYIFPAHLUKSTLACIGIiIQPLGSIFPT 

AEbQARWVTRVFKGLCSLPSERTMMMDI IKRNEKRIDLFGESQS 

QTLQTNYVDYI^ELAIiEIGAKPDFCSLLFKDPKIAV 

SY* YRLVGPGQ WEGARNAI FTQKQRILKPLKTRALKDS SNFS V3 

FLIJCELGIiLAWVAFF\ CQLQWS 


S894 


174 


1673 


RYSPKKVLQNKBSSLKLGMATALVSAHSL^ 

RLRBLCQQWI^PETRTKEHILELLVLEQFLI I LPKBLQARVQEH 

HPESREDVVVVI^1^I^IJGETG(X2VDPDQPKKQM 

KGVQBOQVRHKCEVTKPBKKKGKETRIENGKIirVVTDSCGRVBS 

SGKI SEPMEAHNEGSNLERHQAKPBCEia EYKCSEREQRF IQHLD 

LIEHASTHTGKMiCBSDVCX^SSLTGHKKVIjS* erkviqc\hgv 

LGKAFQRS3Hl A VRHQKIHI^KPyQCNECX3KVFSGWGI 1 LEH^ 

IHTGEKPYbCIHCGKNFRRSSHI^RHQRIHSQEEPCBCKEOGKT 

FSQALtiTHHQRIHSHSKSHQCNECGKAFSliTSDIjIRHIIRIHTG 

BKPFKOilCQKAFRLNSHIjAQHyRlHNEEKPYQCSECGEAFRQR 

SGLFQHQRYHHKDKLA 


■"Saw 


2967 


8* 


HPSLLGAIPFYPPPSSPWPPPLYLFWKSHRJCSRHFXNQRGIHGB 
M^FVSDG\TCOJVLAAAGRARGRAEVI,ISC^PEDCVWFLT 
RPKVPVLQLDSGNYLFSTSAICRYFF\IiIiSGWEQDDIiTNQWriEW 
FATEI^PTI*SAAIaYYL\VVQGKKG\EDVlJGSVRRTLTHIDHSLS 
RQ\NCPFIiAGETKSIiAD I VbWGALYPIjLQDPAYLPKBLS ALHS w 
FQTI^TQ\EPCQR\AARRLVLKQ\QGVX i AI»R\PYLQKQPQPSPA 
EGKGLSP IEPEEEELATLSEBE IAMAVTAWBKGLESIiPPLRPQQ 
NPVLPVAGERKTVLITSALPYVNNVPHLGNI IGCVLSADVFARYS 
RLRQWNTLYIX}GTDEYGTATETKAL\EEGLTPQEICDKYHIIRA. 
DI Y\RWFNI SFDI FGRTTTPQQ\TKIT\QDI FQQUiKRGFVtiQD 
TVEQLRCEHCARF\LADRFVEGVGPFCGYBBARaiX^DKCGKLI 
NAVEI^PQCKVCRSCPVVQSSQII1>FLDLPKLEKRIjEEWLGRTL 

gff>dk\vfyvwfdatigy1>sitanytdqwerww \knpeqvdlyq 
fm\akdnvpfiislvfpssalgaednytl\vsrliateylny^ 
k\fsksrgvgvfrm4\ahdtgippdisrfyl\lyirpegk\dsa 
fswtdli^knns \bllnni^tpinra\gmfvskffgg \ yvpemv 

LTPDIX?RI^\HVTLF^04IYIIQ\LLEKVRIRDALRSIIiT I S\RH 
GNQYl\QVNEPH\KRlKGSFJ^QRAGTVTGrAVNIAAIiLSVML 
QPYMPTVSAT IQAQLQI*P P PACS I LbTNFLCT I>PAGHQIGTV5P 
LFQKIjF2JDQI ESLRCJIFGGGQAKTSPK^^ 

U*ff)EVTKQGN I VRELKAQKADKNE VAAWAKIJ^LKKCK*AVAEG 
KPPBAPKGKIQaC 


5B96 


2967 


86 


HPSIiGAJPFYPPPSSPWPPPLYLPWNSHRKSRHFIlSfQRGIHGB 
MRLFVSDGVPGCLPVIAAAGRARGRAEVI^STVGPBIXrVVPFLT 
RPKVPVLQI^GNYIiFSTSAICKyFFXrilSGWBQDDLTNQKLEH 
BATELQPTI>SAAIjYYL\ VVQGKKG \ EDVLGS VRRTLTH IDHS LS 
RQ\NCPFIiAGETBSIADIVLWGALYPLLQDPAYLPEELSAIiHSW 
FQTLSTQ\EPCX}R\AARRLVI J KQ\0<5VIALR\PYU3KQPQPSPA 
BGKGIiSPIEPEEEBLATLSEBBIAMAVTAWEKGIiESLPPIiRPQQ 

npvlpvagertm,itsalpyvn^phl^iigcvlsadvfarys 
ri^qwntlylcgtdeygtatetkal\ebgltpqeicdkyhi iha 

DIY\ RW FNI E FD I PGRTTTPQO\TKI T\QDIFQQLLKRGFVIiQD 
TVBQLRCEHCARP\ LADRFVEGVCPFCG YEEARGDQCDKCGKLI 
NAVELKKPQCKVCRSCPVV0SSQHIiFIJ)LPKr3KRI*EEWLGRTL 
PGSDWTPNAQFTTPKFGFREWPSKPRWQ*TRDLK\WGNPGTP*E 
GFEDKWFYVWFDATIGYLS ITANYTDQWERWH\ KNPEQVDLYQ 
FM\AKDNVPFHSLVppSSALGAEI»m^\VSHLIATEYlOT 
K\FSKSRGVGVFRDMV\HDTGIPPDISRFYI.\LYIRPEGK\DSA 
FSWTDI»LIiKNNS \BLLNNLGNF INRA\GMFV3KFFGG\YVPEMV 
LTPDI>QRIJUA\HVTLELQHYHQ\l^EKVRIRnALR5 ILTI S \RH 
GNQYI \QVNEPW\KRIKGSEADRQRAGTVTGrJVVNI AALLSVML 
QPYMPTVSATIQAQIjQLPPPACSILLTprFLCTLPAGHQlGTVS P 
LFQKIiENDQIESLRQRFGGGQAKTSPKPAVVBTVTTAKPQQIQA 



396 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 
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residue of 
amino acid 
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Predicted end 
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location 
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to first 
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residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A= Alanine, C=Cysteine, D=5A3partic Acid, E=> 
Glutamic Acid, F^Phenylalanine, Glycine, 
H=Histidine, I«Isoleucine, K«Lysine, 
T.^Leucine, M^Methionine, N^Asparagine, 
P=Proline, Q=Glutamine f R^Arginine, 
S^Serine, T-Threonine, V-Valine, 
W=Tryptophan, Y-Tyrosine, X^Unknown, +=Stop 
Codon, /^possible nucleotide deletion, 
\-possible nucleotide insertion) 








LMDEVTKQGNIVREIjXAQKADKN^ 
KPPEAPKGKKKK 


5897 


29^7 


86 


HPSLLGAIPFYPPPSSPWPPPLYLPWNSHRKSRHFINQRGIHGB 
KIRLFVSI^PGCLPVLAAAGPJ«GRAEVI,ISTVGPEDCVVPFLT 
RPKVPVLQLDSGNYLFSTSAI CRYFF\LLSGWEQDDLTNQWLEW 
EATELQ PTLSAALYYL\ VVCK5KKG\KDVLGS VRRTLTHI DHS LS 
RQXNCTFLAGETF^IiADrVLWGAIiYPliLQDPAYIiPBBI^ALHSW 
FQTTjSTQ \ K POQR \ AARRLVLKQ\QGVI ALR \ FYLQKQPQPS PA 
EGKGLS P I B PEEEELATLSEEE I AMAVTAW EXGLESLPPLRPQQ 
NPVLPVAGKRNVL I TSALPYVNNVPHLGNI IGCVLSADVPARYS 
RLRQWNTLYL CGTDEYGTATETKAl\EEGLTPQE I CDKYHI IIIA 
DIY\RWFNISFDIFGRrTTPQQ\TKIT\QDrFQOJbLKRGFVLQD 
TVEQIjRCEHCARF\IiADRFVEGVCPFCGYKEARGDQC^ 

havelkkpqckvc^cpvvqssqhlfij^^ . 
pgsdwtpnaqfitpffgfrew pskprwq*trdlk\wgnpgtp * k 
gfe3k\ v fyvwfdatigyls i tanytdqwerww\knpeqvdlyq 
fm\akdnvp f1islwps s^vlgaednytl\vsh1>iateyij^yedg 
k\fsksrgvgvfrdm\ahotgippdistifyi,\lyirpbgk\dsa 
fs wtdij^lknnsxbllnndgnp inra\gmfvs kffgg \ yvpemv 
ltpddqpo^\hvtlbiiqhyhq\libkvriri>alrslltis\rh 
gnqy1 \qvnbpw\kri kgsfadrqragtvttkjvvni aallsvml 
qpymptvsattqaqlqlpppacs illtnfiictlpaghqi gtvs p 
lfqklbndq i kslrqrfgggqakts pkpawbtvttakp qqi qa 

LMD2VTKQGNI VRELKAQKAD KNEVAAEVAKT »I^LK KQLAVAEG 
KPPEAPKGKKKK 


5898 


2967 


86 


HPSIJ^AIPFYPPPSSPWPPPLYIiFWNSHPJCSRHFINQRGIHGE 
MPXFVSEKjVPGCLPVLAAAGRARGRAEVIiISTVGPE 
RPKVPVLQLDSGNYLFSTSAI CRYFF\LLSGWEQDDLTNQWLBW 
FATEIjQPTLSAALYYL\VV<^KKG\EDVI^SVRF^THIDHSI^ 

rq\ncpflagbtesladivlwgalypllqdpayi»peelsalhsw 
fqtlstq\epcqr\aapjilvlkq\qgvlalr\pylqkqpqpspa 

EGKGLSPI EPEEEBLATI>SEEEIAMAVTAWEKGLESLPPLRPCX) 

npvlpvagerjnvlitsalpyvnnvphlgni igcvlsadvparys 
rlrqwntlylcgtde ygtate^tkal \begltpqb icdkyiii iha 

DI Y\RWPNIS P1>IPX3RTTTPQQ\TKIT\QDI FQQLLKRGFVI^D 

tveqiircsha^\ladrfvbgvcpfc»ybeargdqcdkogkll 
navblkkpq ckvcrs cpvvqssqrifldlpklbkri^bwlgrtl 
pgsdwtpnaqfitpffgfrew ps rprwq * trdlk\wgupgtp * e 
gfbdk\vfyvn fdat igyls itanytdqwerww\kn peqvdlyq 
pm\aktjnvpfhslvfpssaii3aedhytl\vshlia^ 

K\FSKSRGVGVPRDM\AKO>roiPPDISRFYL\LYIRPEGK\DiSA 

fswtdi^knns\ellnnlgneinra\gmfvskffgg\yvpemv 
ltpddqpilavlivtlblqhyhqxllekwipjw^ i ltis\rh 

G»QYI\QVlffiPW\KRIKGSEADRQRAGWroiiAWIAAIJ^SVML 
QPYMPTVSATIQAQLQLPPPACS illtnflctlpageqigtvsp 
tiFQKLENDQlESIiRQRFGGGQAKTSPKPAVVErVTrAKPQQIQA 
IJTOEVTKQGNIVRELKAQICADKNEVAAEVA^ 
KPPEAPKGKKKK 


5899 


326 


1078 


NCPKS KB PNGVRAPSLP S Pli^AAMALSDVDVKKQ IKHMMAFI EQ 

EANEKAEBIBAKAEEEFHIEKjQRLVQTQRLKIMEYYBKKEKQIE 

O^KKIIjMSTMRNQARLKVLRAPJTOLISDIjIi^ 

EVYO^LLDKLVIiQGLLRLLBPVM i vrcrp\qdlllveaavqkai 

pbymtisqkhvbv\qidkba*lavbcswevwevysgnqrikvsn 

ti^srldlsakqkmpeiri^fgantnrkffi 


5900 


£4 


1469 


KAASRDSP CI^PCPLGGVSSHDI^HRMW^^ 

GGVIYPALPQPNFKSIJ>PIA\mWIIHTASKSLTCAW^ 

KVANTWteFDYVWI^HCRSASCYNSra^^ 

KTI RLDETTLFFTWPDGHVTKYDLNWr VKNS YEGQKQKVJQPRI 

LWNAE I YQQAQVPSVDCQS FLETNEGLKKFLONFLLYGI AFVEN 

VPPTQEHTEK1ABRISLIRETIYGRMWYFTSDFSRGDTAYTKLA 

LJIRIITDTTYFQEPCXSIQVFHCLKHBGTGGRT^ 
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Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, O Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, ^phenylalanine, G=Glycine, 
H»Histidine, I=lsoleucine, K=Lysine , 
L^Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine f R»Arginine, 
S=Serine, T» Threonine, V=Valine, 
W-Tryptophan, y«*Tyrosine, X~Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








QKAPEEFELbSKSAI \KHBYIEDVGBCHQPHDWDWAQS* 1STHG 
/YKELYLI RYNNYDRAVINTVPYDWHRWYTAHRTLTIELRRPE 
NKFWVKLKPGHVLPIDNWRVLHGRKCPTGTOQLCGCYTiTro 
NTARLLGLQA 


5901 




2121 


VA1BQTSI»KMMQAVGG^ARPTGBYICKQCI3WCYTSLDSFQT^ 

KTHLDTVL? KLTCPQCNKE FPNQESLLKHVTT HFM I TS TY Y I CE 

SO)KQPTSVDDLQKHI^DMHTFVFFRCTLCX]KVFDSK^SIQIiH^ 

\AVKHSNEKKVYRCrrSQJWDFRNKTDI<JIJIVKH^ 

CIPt^BSFGTEVFJLQClJITTHSKKYNCKFCSKAFHAiri^ 

REKHCVFBTKTPNCGTMGASEQVQKEBVEI^^ 

DGS EBDVDTSEPM YGCDI CGAAYTMETW1QNHQL1RDHN I RPGE S 

ATVKKKAELI KGNYK(^(^RTFFSENGIJ^I>Kmil^P VKh^fM 

CPIOGBRFPSLLTLTEHKVTHSKSLiyrGNC^rCKMPLQSEEBFI* 

EHCQ^^PDLRNSLTGF3^CWCMQTVTSTLELKXHGTFH^K2K , 1X1N 

OR a vnTTHPf^nHVOin i YKfLAS CIi KEFRS KODI^VKLDINGLPYGL 

CTVGCV^^KSASPGINVPPGTNRPGI/^NENI^IEGKGKVC-GL 

KTRCS * LATFKF* VLKVELPE PHPKPFHRGVS RPDSNS TQLKTP 

QV^PMPRISPSQSDEKKTYQCIKCX^MVFYNEWDIQVHVANHKID 

EGI^ECKLCSQTFDSPAKIXK^IEHSFEGMGGTFKCPVCFTV' 

FVQANKLQQH I FSAHGQEDKI YDCTTQCPQKFF FQTEI*0NHTMTQ 

HSS 


5902 


712 


209 


LKNRRRSRPS IRQSIGSTSVSRWLTSLFTYIiDHTADVQ* V*RBF 
IPLKPRQ*BD*MPQSWLHAV^DTLEEAFEQCAMAMFGYMTDTGT 
VE PLQTVKVBTQGDDLQS LLFHFLDE WL YKFSADE P F I P \ GWGE 
EFSI>S KHPQGTEVKAI TYSAMQVYNE EN PE VFVI I D I 


5903 


2106 


— =rp 

735 


lYPbfDQT DQT^At»PQIJ3fiT^5PPSnPSVIiI.PGDPOPI»OGRGt*PTT 

PALF7^LSAVIH3GAAS PMP PSGIiRIjIiPLLLP LLWI*I»VLTPGRPAA 

GI>3TCK^IDMELVKRKRIEAIRGQILSKIiRLASPPSQGEVPPGP 

LPBAVI^YNSTRDRYAGESABPEPEPEADiYAKEVTRVLMVET 

HNEIYDKFKQSTHSIYMFFNTSELREA\fl?EPVl»LSR 

KLKYEQHVELYQKYSNNSVTOYI^NPJjLAPSDSPEWIiSPDVTGVV 

RQWI^RGGEIEGFRI>SAHCSCDSRW*TIjQVDINGFTTGR\RGDL 

ATlHGMNRPFLIJ^MATPLEPJVQHIjQS\SRHRQAL\iynTY 

HGGRNCLRC/VJIC*HI,IFRKDL\GW\KWI\HE\PKBYHANFC\L 

G PCPYI WS LDTQ YSKVLALYNQ \HKPG\ A3AAP \ CCVPQALEP \ 

LPIVYY\VGRKPKVEQLSNMTVRSCKCS 


5904 


3 


1126 


MhO^IEl^Il^FKEEQRLIYEEIjIKEBKTTNNEIiSAISRKID'rW 
AUGNSETE HAFRAI S S KVPVDKVTPSTLPBEVLD FBKFLQQTGG 
RQGAWDDYDHQNFVKVP^nCHKGKPTFMEEVLEin^PGKTQDEVQQ 
HEKWYQKFLALEERKKES IQIWKTKKQQKRBEIFKLKEKADNTP 
VLFHNKQEDNQKQKEEQRKKQKLAVRAWKKQKS I BMSMKCASQL 
KEEEEKEKKHQKERQP^FKLKLLIiESYTX^KKEQEEFLRIiEKEI 
RBKAEKAEKRKNAADE ISRFQERDLHXLBLKI LDRQAKEDE KSQ 
KQRRIiAKLKEKVENNVS RDPS RljY/irraQR 
ATSTYPT*GYSNLBTRMTBKSMR 


5905 


2B7 


2912 


MAS FPPRVWEKfi tVRXRTlGEIdtiA^AAPFDKKCGRENWTVAP AP 
DGSYFAWS^JGHRTVKLVPWSQCTiQNFLtHGT 
QNSDGGQKNKPREHI IDCGDI VWSIAFGSSVPEKQSRCVNIEWH 
RPR FGQIX}IJ^I»ATGLNSGRI KI WDVYTGKXJJJiLVDHTGVVRDL 
TFAPDGSLILVSASRDKIIJIVWDI^DDG^ 
SCAFSPDSSMLCSVGASKAWAAILV* IjRLCWHHSHTGATMVLS 
WAERVASIATCIX5ATFTIG*SNLAFVU^VLYVHRCWSMSTFCF 
SFFLFFFPKVISPTVKYH* LLSKLI FQFYGIGSLTSETNLM * S I 
WLSNGFSVLFFGII*SDSRDILRX.*PNLKFVLIFF*K* CIVSVQK 
KKKPKR IALLQEERLS *DKPPSSHI»I *QTBVNIRI LFRAILHS * 
I»LIFRI*NCI *TYS *IIDPFYIQMTYDRG*FGKNKMVKF*FIEM 
*LYYFKXI^SFClWr*HPCCLPKiCFHIiAVNILFACSICFSS*A 
QVGDPSI^*TSDYLKGRCQWSNNLLTLRF1^VYFFKNLVVSGKK 
REGGL* YLTLFISVYFS* LVFGlNGFQYSFVVKIiHCI>YFMFRiiI 
FKLTFNRMI*NRICMSALINIaKTDI^LTMT^ 
YN1^*I*QF*Y1CMCHFVLCMSE*SYNI<^^ 
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